Sage Journals: Discover world-class research

Abstract

We describe a new high-performance conjugate-gradient (HPCG) benchmark. HPCG is composed of computations and data-access patterns commonly found in scientific applications. HPCG strives for a better correlation to existing codes from the computational science domain and to be representative of their performance. HPCG is meant to help drive the computer system design and implementation in directions that will better impact future performance improvement.

Keywords

Preconditioned conjugate gradient multigrid smoothing additive Schwarz HPC benchmarking validation and verification

Get full access to this article

View all access options for this article.

References

Bailey

Barscz

Barton

. (1994) The NAS parallel benchmarks. Technical Report no. RNR-94-007, NASA Ames Research Center, USA.

Bailey

Harris

Saphir

. (1995) The NAS parallel benchmarks 2.0. Techinical Report no. NAS-95-020, NASA Ames Research Center, USA.

Byun

Lin

Yelick

. (2012) Autotuning sparse matrix-vector multiplication for multicore. Technical Report no. UCB/EECS-2012-215, University of California, USA.

Chronopoulos

Gear

(1989) s-Step iterative methods for symmetric linear systems. Journal of Computational and Applied Mathematics 25: 153–168.

D’Azevedo

Eijkhout

Romine

(1993) LAPACK working note 56: Reducing communication costs in the conjugate gradient algorithm on distributed memory multiprocessor. Technical Report no. CS-93-185, University of Tennessee, Knoxville, USA.

der Wijngaart

RFV

(2002) NAS parallel benchmarks version 2.4. Technical Report no. NAS-02-007, Computer Sciences Corporation, NASA Advanced Supercomputing (NAS) Division, USA, October.

Dongarra

Eijkhout

(2003) Finite-choice algorithm optimization inconjugate gradients. Technical Report no. 159, LAPACK Working Note, University of Tennessee, USA.

Dongarra

Eijkhout

van der Vorst

(2001) Iterative solver benchmark. Scientific Programming 9(4): 223–231.

Dongarra

Heroux

(2013) Toward a new metric for ranking high performance computing systems. Technical Report no. SAND2013-4744, Sandia National Laboratories, USA.

10.

Dongarra

Luszczek

Petitet

(2003) The LINPACK benchmark: Past, present, and future. Concurrency and Computation: Practice and Experience 15(9): 803–820.

11.

Eijkhout

(1992) LAPACK working note 51: Qualitative properties of the conjugate gradient and Lanczos methods in a matrix framework. Technical Report no. CS 92-170, University of Tennessee, USA.

12.

ORNL Leadership Computing Facility (2013a) Annual Report 2012–2013, Available at: https://www.olcf.ornl.gov/wp-content/uploads/2014/03/2013_ARv2M.pdf (accessed 10 August 2015).

13.

ORNL Leadership Computing Facility (2013b) Introducing Titan — the world’s #1 open science supercomputer, Available at: http://www.olcf.ornl.gov/titan (accessed 29 May 2013).

14.

Ghysels

Vanroose

(2012) Hiding global synchronization latency in the preconditioned Conjugate Gradient algorithm. Technical Report no. 12.2012.1, Intel Labs Europe. Presented at PRECON13, June 19–21, 2013, Oxford, UK.

15.

Heroux

Doerfler

Crozier

. (2009) Improving performance via mini-applications. Technical Report no. SAND2009-5574, Sandia National Laboratories.

16.

Hoefler

Gottschling

Lumsdaine

. (2007) Optimizing a Conjugate Gradient Solver with Non-Blocking Collective Operations. Elsevier Journal of Parallel Computing 33(9): 624–633.

17.

Yelick

Vuduc

(2004) Sparsity: Optimization framework for sparse matrix kernels. International Journal of High Performance Computing Applications 18(1): 135–158.

18.

Joubert

Kothe

Nam

(2009) Preparing for exascale: ORNL leadership computing facility application requirements and strategy. Technical Report no. ORNL/TM-2009/308, Oak Ridge National Laboratory, USA, December.

19.

Liu

Smelyanskiy

Chow

. (2013) Efficient sparse matrix-vector multiplication on x86-based many-core processors. In: ICS’13, Eugene, OR, 10–14 June 2013.

20.

Luszczek

Dongarra

(2010) Analysis of various scalar, vector, and parallel implementations of RandomAccess. Technical Report no. ICL-UT-10-03, Innovative Computing Laboratory, USA.

21.

Luszczek

Dongarra

Kepner

(2006) Design and implementation of the HPCC benchmark suite. CT Watch Quarterly 2(4): 18–23.

22.

Mattheij

RMM

Rienstra

ten Thije Boonkkamp

JHM

(2005) Partial Differential Equations, Modeling, Analysis, Computation. Philadelphia: SIAM.

23.

Meuer

Strohmaier

Dongarra

. (2013) TOP500 supercomputer sites, 42nd ed. Avaliable from: http://www.netlib.org/benchmark/top500.html (accessed 10 August 2015).

24.

Meurant

(1987) Multitasking the conjugate gradient method on the CRAY X-MP/48. Parallel Computing 5: 267–280.

25.

Saad

(2003) Iterative Methods for Sparse Linear Systems. 2nd ed. Philadelphia, PA: Society for Industrial and Applied Mathematics.

26.

Smith

Bjørstad

Gropp

(1996) Domain Decomposition, Parallel Multilevel Methods for Elliptic Partial Differential Equations. Cambridge, MA: Cambridge University Press.

27.

Trottenberg

Oosterlee

Schüller

(2001) Multigrid. London: Academic Press.

28.

Vuduc

Demmel

Yelick

(2005) OSKI: A library of automatically tuned sparse matrix kernels. In: Proceedings of SciDAC 2005, Journal of Physics: Conference Series, San Francisco, CA, 2005, pp. 51–530. Bristol, UK: IOPscience.

High-performance conjugate-gradient benchmark: A new metric for ranking high-performance computing systems

Abstract

Keywords

Get full access to this article

References