Jump to content

Petascale computing

From Wikipedia, the free encyclopedia

Petascale computing refers to computing systems capable of performing at least 1 quadrillion (10^15) floating-point operations per second (FLOPS). These systems are often called petaflops systems and represent a significant leap from traditional supercomputers in terms of raw performance, enabling them to handle vast datasets and complex computations.

Definition

[edit]

Floating point operations per second (FLOPS) are one measure of computer performance. FLOPS can be recorded in different measures of precision, however the standard measure (used by the TOP500 supercomputer list) uses 64 bit (double-precision floating-point format) operations per second using the High Performance LINPACK (HPLinpack) benchmark.[1][2]

The metric typically refers to single computing systems, although can be used to measure distributed computing systems for comparison. It can be noted that there are alternative precision measures using the LINPACK benchmarks which are not part of the standard metric/definition.[2] It has been recognized that HPLinpack may not be a good general measure of supercomputer utility in real world application, however it is the common standard for performance measurement.[3][4]

History

[edit]

The petaFLOPS barrier was first broken on 16 September 2007 by the distributed computing Folding@home project.[5] The first single petascale system, the Roadrunner, entered operation in 2008.[6] The Roadrunner, built by IBM, had a sustained performance of 1.026 petaFLOPS. The Jaguar became the second computer to break the petaFLOPS milestone, later in 2008, and reached a performance of 1.759 petaFLOPS after a 2009 update.[7]

In 2020, Fugaku became the fastest supercomputer in the world, reaching 415 petaFLOPS in June 2020. Fugaku later achieved an Rmax of 442 petaFLOPS in November of the same year.

By 2022, exascale computing had been reached with the development of Frontier, surpassing Fugaku with an Rmax of 1.102 exaFLOPS in June 2022.[8]

Artificial intelligence

[edit]

Modern artificial intelligence (AI) systems require large amounts of computational power to train model parameters. OpenAI employed 25,000 Nvidia A100 GPUs to train GPT-4, using 133 trillion floating point operations.[9]

See also

[edit]

References

[edit]
  1. ^ "FREQUENTLY ASKED QUESTIONS". www.top500.org. Retrieved 23 June 2020.
  2. ^ a b Kogge, Peter, ed. (1 May 2008). ExaScale Computing Study: Technology Challenges in Achieving Exascale Systems (PDF). United States Government. Retrieved 28 September 2008.
  3. ^ Bourzac, Katherine (November 2017). "Supercomputing poised for a massive speed boost". Nature. 551 (7682): 554–556. doi:10.1038/d41586-017-07523-y. Retrieved 3 June 2022.
  4. ^ Reed, Daniel; Dongarra, Jack. "Exascale Computing and Big Data: The Next Frontier" (PDF). Retrieved 3 June 2022.
  5. ^ Michael Gross (2012). "Folding research recruits unconventional help". Current Biology. 22 (2): R35–R38. doi:10.1016/j.cub.2012.01.008. PMID 22389910.
  6. ^ National Research Council (U.S.) (2008). The potential impact of high-end capability computing on four illustrative fields of science and engineering. The National Academies. p. 11. ISBN 978-0-309-12485-0.
  7. ^ National Center for Computational Sciences (NCCS) (2010). "World's Most Powerful Supercomputer for Science!". NCCS. Archived from the original on 2009-11-27. Retrieved 2010-06-26.
  8. ^ "June 2022 | TOP500". www.top500.org. Retrieved 2024-11-21.
  9. ^ Minde, Tor Björn (2023-10-08). "Generative AI does not run on thin air". RISE. Retrieved 2024-03-29.
[edit]