Publications - DataScience

2015

Carr, Hamish; Sewell, Christopher; Lo, Li-Ta; james Ahrens,

Hybrid Data-Parallel Contour Tree Computation Proceedings Article

In: 2015, (LA-UR-15-24759).

Abstract | Links | BibTeX | Tags: and object reppresentations, computational geometry and object modeling, contour tree, data-parallel, gpu, multi-core, nvidia thrust, simulation output analysis, solid, surface, topological analysis

2013

Sewell, Christopher; Lo, Li-ta; Ahrens, James

Portable data-parallel visualization and analysis in distributed memory environments Proceedings Article

In: Large-Scale Data Analysis and Visualization (LDAV), 2013 IEEE Symposium on, pp. 25–33, IEEE 2013, (LA-UR-13-23809).

Abstract | Links | BibTeX | Tags: analysis, Concurrent Programming, data-parallel, distributed memory, parallel programming, PISTON, visualization

@inproceedings{sewell2013portable,

title = {Portable data-parallel visualization and analysis in distributed memory environments},

author = {Christopher Sewell and Li-ta Lo and James Ahrens},

url = {http://datascience.dsscale.org/wp-content/uploads/2016/06/PortableData-ParallelVisualizationAndAnalysisInDistributedMemoryEnvironments.pdf},

year  = {2013},

date = {2013-01-01},

booktitle = {Large-Scale Data Analysis and Visualization (LDAV), 2013 IEEE Symposium on},

pages = {25--33},

organization = {IEEE},

abstract = {Data-parallelism is a programming model that maps well to architectures with a high degree of concurrency. Algorithms written using data-parallel primitives can be easily ported to any architecture for which an implementation of these primitives exists, making efficient use of the available parallelism on each. We have previously published results demonstrating our ability to compile the same data-parallel code for several visualization algorithms onto different on-node parallel architectures (GPUs and multi-core CPUs) using our extension of NVIDIAÕs Thrust library. In this paper, we discuss our extension of Thrust to support concurrency in distributed memory environments across multiple nodes. This enables the application developer to write data-parallel algorithms while viewing the data as single, long vectors, essentially without needing to explicitly take into consideration whether the values are actually distributed across nodes. Our distributed wrapper for Thrust handles the communication in the backend using MPI, while still using the standard Thrust library to take advantage of available on-node parallelism. We describe the details of our distributed implementations of several key data-parallel primitives, including scan, scatter/ gather, sort, reduce, and upper/lower bound. We also present two higher-level distributed algorithms developed using these primitives: isosurface and KD-tree construction. Finally, we provide timing results demonstrating the ability of these algorithms to take advantage of available parallelism on nodes and across multiple nodes, and discuss scaling limitations for communication-intensive algorithms such as KD-tree construction.},

note = {LA-UR-13-23809},

keywords = {analysis, Concurrent Programming, data-parallel, distributed memory, parallel programming, PISTON, visualization},

pubstate = {published},

tppubtype = {inproceedings}

}

2012

Sewell, Christopher; Meredith, Jeremy; Moreland, Kenneth; Peterka, Tom; DeMarle, David; Lo, Li-ta; Ahrens, James; Maynard, Robert; Geveci, Berk

The SDAV software frameworks for visualization and analysis on next-generation multi-core and many-core architectures Proceedings Article

In: High Performance Computing, Networking, Storage and Analysis (SCC), 2012 SC Companion:, pp. 206–214, IEEE 2012, (LA-UR-12-26928).

Abstract | Links | BibTeX | Tags: data-parallel, in-situ, many-core architectures, mult-core architectures, visualization, VTK-m

2007

McCormick, Patrick; Inman, Jeff; Ahrens, James; Mohd-Yusof, Jamaludin; Roth, Greg; Cummins, Sharen

Scout: a data-parallel programming language for graphics processors Journal Article

In: Parallel Computing, vol. 33, no. 10, pp. 648–662, 2007, (LA-UR-07-2094).

Abstract | Links | BibTeX | Tags: data-parallel, Graphics Systems

1995

Ahrens, James; Hansen, Charles

Cost-effective data-parallel load balancing Proceedings Article

In: ICPP (2), pp. 218–221, 1995, (LA-UR-95-1462).

Abstract | Links | BibTeX | Tags: data-parallel, load balancing

1993

Ortega, Frank; Hansen, Charles; Ahrens, James

Fast Data Parallel Polygon Rendering Proceedings Article

In: Proceedings of the 1993 ACM/IEEE Conference on Supercomputing, pp. 709–718, ACM, Portland, Oregon, USA, 1993, ISBN: 0-8186-4340-4, (LA-UR-93-3173).

Abstract | Links | BibTeX | Tags: data-parallel, polygon rendering

: . .

Carr, Hamish; Sewell, Christopher; Lo, Li-Ta; james Ahrens,

Hybrid Data-Parallel Contour Tree Computation Proceedings Article

In: 2015, (LA-UR-15-24759).

Abstract | Links | BibTeX

Sewell, Christopher; Lo, Li-ta; Ahrens, James

Portable data-parallel visualization and analysis in distributed memory environments Proceedings Article

In: Large-Scale Data Analysis and Visualization (LDAV), 2013 IEEE Symposium on, pp. 25–33, IEEE 2013, (LA-UR-13-23809).

Abstract | Links | BibTeX

@inproceedings{sewell2013portable,

title = {Portable data-parallel visualization and analysis in distributed memory environments},

author = {Christopher Sewell and Li-ta Lo and James Ahrens},

url = {http://datascience.dsscale.org/wp-content/uploads/2016/06/PortableData-ParallelVisualizationAndAnalysisInDistributedMemoryEnvironments.pdf},

year  = {2013},

date = {2013-01-01},

booktitle = {Large-Scale Data Analysis and Visualization (LDAV), 2013 IEEE Symposium on},

pages = {25--33},

organization = {IEEE},

abstract = {Data-parallelism is a programming model that maps well to architectures with a high degree of concurrency. Algorithms written using data-parallel primitives can be easily ported to any architecture for which an implementation of these primitives exists, making efficient use of the available parallelism on each. We have previously published results demonstrating our ability to compile the same data-parallel code for several visualization algorithms onto different on-node parallel architectures (GPUs and multi-core CPUs) using our extension of NVIDIAÕs Thrust library. In this paper, we discuss our extension of Thrust to support concurrency in distributed memory environments across multiple nodes. This enables the application developer to write data-parallel algorithms while viewing the data as single, long vectors, essentially without needing to explicitly take into consideration whether the values are actually distributed across nodes. Our distributed wrapper for Thrust handles the communication in the backend using MPI, while still using the standard Thrust library to take advantage of available on-node parallelism. We describe the details of our distributed implementations of several key data-parallel primitives, including scan, scatter/ gather, sort, reduce, and upper/lower bound. We also present two higher-level distributed algorithms developed using these primitives: isosurface and KD-tree construction. Finally, we provide timing results demonstrating the ability of these algorithms to take advantage of available parallelism on nodes and across multiple nodes, and discuss scaling limitations for communication-intensive algorithms such as KD-tree construction.},

note = {LA-UR-13-23809},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Sewell, Christopher; Meredith, Jeremy; Moreland, Kenneth; Peterka, Tom; DeMarle, David; Lo, Li-ta; Ahrens, James; Maynard, Robert; Geveci, Berk

The SDAV software frameworks for visualization and analysis on next-generation multi-core and many-core architectures Proceedings Article

In: High Performance Computing, Networking, Storage and Analysis (SCC), 2012 SC Companion:, pp. 206–214, IEEE 2012, (LA-UR-12-26928).

Abstract | Links | BibTeX

McCormick, Patrick; Inman, Jeff; Ahrens, James; Mohd-Yusof, Jamaludin; Roth, Greg; Cummins, Sharen

Scout: a data-parallel programming language for graphics processors Journal Article

In: Parallel Computing, vol. 33, no. 10, pp. 648–662, 2007, (LA-UR-07-2094).

Abstract | Links | BibTeX

Ahrens, James; Hansen, Charles

Cost-effective data-parallel load balancing Proceedings Article

In: ICPP (2), pp. 218–221, 1995, (LA-UR-95-1462).

Abstract | Links | BibTeX

Ortega, Frank; Hansen, Charles; Ahrens, James

Fast Data Parallel Polygon Rendering Proceedings Article

In: Proceedings of the 1993 ACM/IEEE Conference on Supercomputing, pp. 709–718, ACM, Portland, Oregon, USA, 1993, ISBN: 0-8186-4340-4, (LA-UR-93-3173).

Abstract | Links | BibTeX