CACR Research Publications » Posts for tag 'high-performance computing'

A (Hypothetical) Data to Discovery Engine (v2)

Mark Stalzer, Caltech – CACR

Seminar given at “IST Lunch Bunch”
Tuesday November 2, 2010
12:00 PM
105 Annenberg

Abstract:
Moore’s law works for semiconductor-based detectors and there is an increasing flood of data being generated in astronomy, high energy physics, biology, and other sciences. Computation is essential for both (1) making predictions from theory and (2) the analysis of data from experiments. Is there a way to architecturally balance both needs in a high performance computing (HPC) system?

HPC systems are typically constructed from easily available parts, just organized differently and scaled to process extreme workloads. The most power efficient petascale machine as of 2010 is Roadrunner at the Los Alamos National Laboratory. Another interesting machine is the Apple iPad which uses very low power System on a Chip/Package on Package technologies. This talk explores the question of “what happens when you cross Roadrunner with iPads”? The result is a high level of integration between computation and storage on a single server blade, called a Flashblade, with 100x-1,000x performance improvements for some data-centric applications. The Flashblade architecture, expected performance, programming, and scaling with advancing technology are discussed.

(PDF)

A (Hypothetical) Data to Discovery Engine

AstroInformatics 2010
Caltech, June 16-19, 2010

Mark Stalzer

High performance computing systems are typically constructed from easily available parts, just organized differently and scaled to process extreme workloads. The most power efficient petascale machine as of 2010 is Roadrunner at the Los Alamos National Laboratory. Another interesting machine is the Apple iPad which uses very low power System on a Chip/Package on Package technologies. This talk explores the question of “what happens when you cross Roadrunner with iPads”?

(PDF)

Applications Architecture Power Puzzle – SC09 Panel

Click names for presentation PDF.

Moderator:

Panelists:

  • Thomas Sterling, Louisiana State University
  • Allan Snavely, San Diego Supercomputer Center
  • Stephen Poole, Oak Ridge National Laboratory
  • William Camp, Intel Corporation

Abstract:
Over the next few years there are two boundary conditions that should constrain computer systems architecture: commodity components and applications performance. Yet, these two seem strangely disconnected. Perhaps we need some human optimization, as opposed to repeated use of Moore’s Law. Our panelists have been given a set of standard components that are on announced vendor roadmaps. They also each get to make one mystery component of no more complexity than a commercially available FPGA. The applications are HPL for linear algebra, Map-Reduce for databases, and a sequence matching algorithm for biology. The panelists have 10 minutes to disclose their systems architecture and mystery component, and estimate performance for the three applications at 1MW of power.

Whole-volume integrated gyrokinetic simulation of plasma turbulence in realistic diverted-tokamak geometry

SciDAC 2009, Journal of Physics: Conference Series. J Phys: Conf Ser 180 (2009) 012057.

C S Chang, S Ku, P Diamond, M Adams, R Barreto, Y Chen, J Cummings, E D’Azevedo, G Dif-Pradalier, S Ethier, L Greengard, T S Hahm, F Hinton, D Keyes, S Klasky, Z Lin, J Lofstead, G Park, S Parker, N Podhorszki, K Schwan, A Shoshani, D Silver, M Wolf, P Worley, H Weitzner, E Yoon and D Zorin

Scaling to 150K cores: recent algorithm and performance engineering developments enabling XGC1 to run at scale

SciDAC 2009, Journal of Physics: Conference Series. J Phys: Conf Ser 180 (2009) 012036.

M Adams, S Ku, P Worley, E D’Azevedo, J Cummings and C S Chang

Porting Some Key Caltech & JPL Applications to a PS3 Cluster – A Wild Ride

High Performance Embedded Computing Workshop,/ Lexington, MA, September 2008.

Paul Springer, Ed Upchurch, Mark Stalzer, Sean Mauch, John McCorquodale, Jan Lindheim, and Michael Burl

(PDF)

The “MIND” Scalable PIM Architecture

In: Advanced Research Workshop on High Performance Computing Technology and Applications, 31 May-3 June, 2004, Cetraro, Italy. [CaltechCACR:2005.102]

Thomas Sterling and Maciej Brodowicz

(PDF )

Reusable and Extensible High Level Data Distributions

In: Workshop on Patterns in High Performance Computing, 4-6 May, 2005, University of Illinois at Urbana-Champaign.

Roxana E. Diaconescu, Bradford Chamberlain and Hans P. Zima (2005)

(PDF)

Continuum Computer Architecture for Nano-scale and Ultra-high Clock Rate Technologies

In: International Workshop on Innovative Architecture for Future Generation High-Performance Processors and Systems, January 17-19, 2005, Oahu, Hawaii. [CaltechCACR:2005.101]

Thomas Sterling and Maciej Brodowicz

(PDF)

21st Century Simulation: Exploiting High Performance Computing and Data Analysis

In: The Interservice/Industry Training, Simulation and Education Conference (I/ITSEC), 6-9 December, 2004, Orlando, FL.

Dan M. Davis, Garth D. Baer and Thomas D. Gottschalk

(PDF)