Jump to content

Cerebras Systems

From Wikipedia, the free encyclopedia

Cerebras Systems Inc.
TypePublic
Industry
Founded2015; 11 years ago (2015)
Founders
  • Andrew Feldman
  • Gary Lauterbach
  • Michael James
  • Sean Lie
  • Jean-Philippe Fricker
HeadquartersSunnyvale, California, US,
Key people
Andrew Feldman (CEO)
Robert Komin (CFO)
Sean Lie (CTO)
Dhiraj Mallick (COO)
ProductsSemiconductors
Supercomputers
APIs
Data centers
RevenueIncrease $510 million (2025)
Increase $87 million (2025)
Total assetsIncrease $2.326 billion (2025)
Total equityIncrease -$578 million (2025)
OwnerBenchmark (17.6 million shares)
Foundation Capital (15.3 million shares)
Eclipse Ventures Fund 1 (13.5 million shares)
Alpha Wave (12.1 million shares)
Andrew D. Feldman (10.1 million shares)
Sean Lie (5.3 million shares)
G42 (3.5 million shares)
Dhiraj Mallick (1.8 million shares)
OpenAI (warrants to purchase 33.4 million shares, subject to purchases of compute capacity by OpenAI)
Number of employees
708 (2025)
Websitecerebras.ai
Footnotes
[1]

Cerebras Systems Inc., headquartered in Sunnyvale, California, develops semiconductors, supercomputers, and related software to power artificial intelligence deep-learning applications such as inference engines. Products include its wafer scale engine (WSE)-3 semiconductors, its CS-3 supercomputers, and its "AI inference cloud" and "AI training cloud" APIs, which allow users to access the company's computing power without buying its hardware. The company also builds data centers using its processors and supercomputers to provide cloud computing services directly to clients.

Measuring 215 mm (8.5 in) squared, the company's WSE-3 semiconductors are currently the largest AI semiconductors ever built. They take up entire silicon wafers and use wafer-scale integration and switched fabric. This reduces latency and interconnect bottlenecks compared to GPU clusters. They use static random-access memory, as opposed to dynamic random-access memory. Cerebras semiconductors and computer systems are much more powerful than those of competitors; however, they have disadvantages due to their large size, 25kW power draw, and cost of as much as $3 million per node.

The company has 4 major customers: the Mohamed bin Zayed University of Artificial Intelligence (62% of 2025 revenues), G42 (24% of 2025 revenues), OpenAI (signed in 2026), and Amazon Web Services (signed in 2026).[1]

Cerebras has offices in Sunnyvale, San Diego, Toronto, and Bangalore, India. Its semiconductors are manufactured by TSMC, currently the only company that has the ability to manufacture Cerebras chips.[1]

The company's primary competitors for its hardware are Nvidia, AMD, Intel, and Broadcom and the company's primary competitors for its cloud computing services are Amazon Web Services, Microsoft Azure, Google Cloud Platform, Oracle Corporation, and CoreWeave.[1]

History

[edit]

Cerebras Systems was founded in 2015 by Andrew Feldman, Gary Lauterbach, Michael James, Sean Lie, and Jean-Philippe Fricker.[2] These five founders worked together at SeaMicro, which was started in 2007 by Feldman and Lauterbach and sold to AMD in 2012 for $334 million.[3]

The founders knew that GPUs were not the optimal semiconductors for high-level processes. However, they had to design unique cooling methods to prevent a "massive" semiconductor from burning when drawing power, unique software to route around usual microscopic manufacturing defects, and they had to invent a machine that could drill 40 screws into the wafer simultaneously without it cracking.[4][5]

The company had difficulty solving the problem of integrated circuit packaging: adhering the silicon to a motherboard, receiving power, and dealing with heating and cooling and the pipes to deliver and return data. It was burning through $8 million per month and spent $200 million trying to solve the problem. In July 2019, after exhaustive trial and error, the company finally produced a product that worked.[5]

In August 2019, Cerebras announced WSE-1, its first-generation Wafer-Scale Engine (WSE) semiconductors and its CS-1 supercomputing system.[6][7][8][9][10] The CS-1 is a 19-inch rack-mounted appliance and includes a single WSE primary processor with 400,000 processing cores, 1.2 trillion transistors (twelve 100-gigabit ethernet connections), and 18 gigabytes of memory.[6][7][8]

The company's first customers were educational institutions and life-sciences companies that were building supercomputers for purposes of drug discovery, computational fluid dynamics, genetic and genomic research, to predict response to drugs, and for COVID-19 research.[1] Early customers included GlaxoSmithKline,[11][12] AstraZeneca,[13] the National Energy Technology Laboratory,[14] Lawrence Livermore National Laboratory,[15] the Pittsburgh Supercomputing Center,[16] and Edinburgh Parallel Computing Centre.[17]

In September 2020, the company opened an office in Japan and partnered with Tokyo Electron.[18]

In April 2021, the company released its CS-2 system, based on the company's Wafer Scale Engine Two (WSE-2), which has 850,000 cores.[19] The CS-2 is manufactured by the 7 nm process of TSMC.[19] It is 26 inches (660 mm) tall and fits in one-third of a standard data center rack.[20][19] The WSE-2 has 850,000 cores and 2.6 trillion transistors.[20][21] It enables a single system to support AI models with more than 120 trillion parameters.[22][23] The WSE-2 expanded on-chip SRAM to 40 gigabytes, memory bandwidth to 20 petabytes per second, and total fabric bandwidth to 220 petabits per second.[12] Customers included TotalEnergies,[24] nference,[25] the National Center for Supercomputing Applications (NCSA),[26] and the Leibniz Supercomputing Centre.[27]

In August 2021, Cerebras announced a partnership with Peptilogics on the development of AI for peptide therapeutics.[28]

In June 2022, Cerebras set a record for the largest AI models ever trained on one device—a single CS-2 system with one Cerebras wafer trained models with up to 20 billion parameters.[29][30] The Cerebras CS-2 system can train multibillion-parameter natural-language-processing (NLP) models including GPT-3XL 1.3 billion models, as well as GPT-J 6B, GPT-3 13B, and GPT-NeoX 20B with reduced software complexity and infrastructure.[30][29]

In August 2022, the Computer History Museum in Mountain View, California unveiled a new display featuring the WSE-2, named "The Biggest Chip In the World".[31][32]

Also in August 2022, Cerebras opened an office in Bangalore, India.[33]

In September 2022, Cerebras announced that it can patch its chips together to create what would be the largest-ever computing cluster for AI computing.[34] A Wafer-Scale Cluster can connect up to 192 CS-2 AI systems into a cluster, while a cluster of 16 CS-2 AI systems can create a computing system with 13.6 million cores for natural-language processing.[34] It uses data parallelism to train.[35]

In October 2022, Sandia National Laboratories of the National Nuclear Security Administration began using the CS-2 in nuclear stockpile stewardship computing, to determine if nuclear weapons will work as intended.[36]

In November 2022, Cerebras unveiled the Andromeda supercomputer, which combines 16 WSE-2 chips into one cluster with 13.5 million AI-optimized cores, delivering up to 1 exaflop of AI computing horsepower, or at least one quintillion (1018) operations per second.[37][38] The entire system consumes 500 kW, which was a drastically lower amount than somewhat-comparable GPU-accelerated supercomputers.[37]

In November 2022, Cerebras announced a partnership with Cirrascale Cloud Services to provide a flat-rate "pay-per-model" compute time for its Cerebras AI Model Studio.[39]

In November 2022, the National Energy Technology Laboratory (NETL) set milestones using Cerebras products.[40]

In November 2022, Argonne National Laboratory won the 2022 Gordon Bell Special Prize for COVID-19 research by using the CS-2 as well as products from Nvidia and Hewlett-Packard to transform large language models to analyze and predict variants of SARS-CoV-2.[41]

In July 2023, G42 agreed to pay around $100 million to purchase the first of potentially nine supercomputers from Cerebras. The first computer, Condor Galaxy 1 (CG-1), is capable of 4 quintillion floating-point operations per second (4 exaflops) of compute and contains 54 million cores.[42][43][44][45] In November 2023, the second computer, the Condor Galaxy 2 (CG-2), was announced, with similar specifications as the CG-1. In March 2024, the companies broke ground on the Condor Galaxy 3 (CG-3), which can reach 8 exaFLOPs of performance and contains 58 million AI-optimized cores.[46]

In August 2023, Cerebras, the Mohamed bin Zayed University of Artificial Intelligence, and G42 subsidiary Inception launched Jais, a large language model.[47]

In March 2024, Cerebras introduced Wafer Scale Engine (WSE-3) architecture, a 5nm-based chip hosting 4 trillion transistors and 900,000 AI-optimized cores, and the basis for the CS-3. It has twice the performance of CS-2. The CS-3 was named to the list of the "best inventions of 2024" by Time.[48]

In May 2024, Cerebras in collaboration with researchers from Sandia National Laboratories, Lawrence Livermore National Laboratory, Los Alamos National Laboratory, and the National Nuclear Security Administration, simulated 800,000 atoms interacting with each other, calculating the interactions in increments of one femtosecond at a time. Each step took just microseconds to compute on the Cerebras WSE-2, much faster than on the Frontier supercomputer.[49]

Cerebras was named to the "100 Most Influential Companies list" by Time in May 2024.[50]

In June 2024, Cerebras announced a collaboration with Dell Technologies, for AI compute infrastructure for generative AI.[51]

In August 2024, Cerebras launched its AI inference service, which it claimed to be ten to twenty times faster than systems built using Nvidia's H100 Hopper graphics processing units.[52]

In January 2025, the Mayo Clinic announced a collaboration with Cerebras to combine genomic data with de-identified data from patient records and medical evidence to explore the ability to predict a patient's response to treatments to manage disease.[53]

In January 2025, Cerebras announced support for DeepSeek's R1 70B reasoning model at 1,600 tokens/second.[54]

In February 2025, Mistral AI began using Cerebras products to power its Le Chat service and reached a speed record.[55]

Also in February 2025, Cerebras announced a partnership with Perplexity AI's Sonar model, which runs on Cerebras chips at 1,200 tokens per second.[56]

In March 2025, Cerebras announced plans to construct six new datacenters in Dallas, Minneapolis, Oklahoma City, Montreal, New York, and France, increasing inference capacity twentyfold to over 40 million tokens per second, expecting increased demand from Llama 4 and DeepSeek.[57] Meta Platforms agreed to use Cerebras products to power the Llama API in April 2025.[58]

In April 2025, Cerebras and Ranovus announced a contract from DARPA.[59]

In May 2025, Cerebras beat NVIDIA's Blackwell in Llama 4 Inference with more than 2,500 tokens per second/user, compared to 1,000 for Blackwell, on the 400B-parameter Llama 4 Maverick model in testing by an independent benchmarking firm.[60]

In July 2025, Cerebras unveiled Qwen3-235B, an ultra-fast deployment of Alibaba Group's open-weight Qwen AI models, with full 131k context support on its inference cloud platform.[61]

In January 2026, Cerebras signed a deal with OpenAI to deliver 750 megawatts of computing power through 2028 for $10 billion.[62] As part of the agreement with OpenAI, the company is temporarily prohibited from selling its products to Anthropic.[5]

In March 2026, Amazon Web Services agreed to purchase CS-3 systems to be used for its Trainium-powered servers set to be deployed on Amazon Bedrock in its data centers.[63]

Financing

[edit]

In May 2016, Cerebras raised $27 million in a series A round led by Benchmark, Foundation Capital and Eclipse Ventures.[64][2]

In December 2016, the company raised a series B round led by Coatue Management. In January 2017, it raised a series C round led by VY Capital.[2]

In November 2018, Cerebras raised $88 million in a series D round, making the company a unicorn. Investors in this round included Altimeter, VY Capital, Coatue, Foundation Capital, Benchmark, and Eclipse.[65][66]

In November 2019, Cerebras raised $270 million in a series E round at a valuation of $2.4 billion.[67]

In November 2021, Cerebras raised $250 million in a Series F round, valuing the company at over $4 billion and bringing its total financing to $720 million. The Series F financing round was led by Alpha Wave Ventures and Abu Dhabi Growth Fund (ADG).[68]

In July, August, and September 2024, the company raised $85 million in a series F-1 round, selling shares for $14.66 each.[1]

In September 2025, the company raised $1.1 billion in a Series G round, selling shares for $36.23 each, or a valuation of $8.1 billion.[69][1]

In January 2026, the company raised $1.0 billion in a Series H round, selling shares for $89.01 each. In this round, Benchmark invested $225 million, while Alpha Wave and funds managed by Fidelity Investments each invested $100 million.[1]

On May 14, 2026, the company became a public company via the largest U.S. technology initial public offering since 2019, raising $5.55 billion by selling 30 million shares at $185 each and opening on Nasdaq at $350, valuing the company at $95 billion.[70] CBRS stock surged by 68% on its first day of trading.[71]

See also

[edit]

References

[edit]
  1. ^ a b c d e f g h "Registration Statement (Form S-1)". United States Securities and Exchange Commission. May 11, 2026.
  2. ^ a b c Tilley, Aaron (August 31, 2017). "AI Chip Boom: This Stealthy AI Hardware Startup Is Worth Almost A Billion". Forbes.
  3. ^ Haddad, CJ (May 14, 2026). "Cerebras IPO mints two billionaires, sets stage for potential AI wave". CNBC.
  4. ^ Bort, Julie (May 14, 2026). "Cerebras IPO makes billions for Benchmark but VC Eric Vishria almost didn't take the meeting". TechCrunch.
  5. ^ a b c Bort, Julie (May 16, 2026). "$60B AI chip darling Cerebras almost died early on, burning $8M a month". TechCrunch.
  6. ^ a b Metz, Cade (August 19, 2019). "To Power A.I., Start-Up Creates a Giant Computer Chip". The New York Times. ISSN 0362-4331.
  7. ^ a b Crichton, Danny (November 19, 2019). "The Cerebras CS-1 computes deep learning AI problems by being bigger, bigger, and bigger than any other chip". TechCrunch.
  8. ^ a b Moore, Samuel K. (April 20, 2021). "Cerebras' New Monster AI Chip Adds 1.4 Trillion Transistors". IEEE Spectrum.
  9. ^ Takahashi, Dean (November 19, 2019). "Cerebras Systems deploys the 'world's fastest AI computer' at Argonne National Lab". VentureBeat.
  10. ^ Moore, Samuel K. (January 1, 2020). "Cerebras's Giant Chip Will Smash Deep Learning's Speed Barrier". IEEE Spectrum.
  11. ^ Ray, Tiernan (September 5, 2020). "Glaxo's biology research with novel Cerebras machine shows hardware may change how AI is done". ZDNet.
  12. ^ a b Moss, Sebastian (April 21, 2021). "Cerebras debuts new 2.6 trillion transistor wafer scale chip for AI". Data Center Dynamics.
  13. ^ Shah, Agam (May 6, 2020). "National Lab Taps AI Machine With Massive Chip to Fight Coronavirus". The Wall Street Journal. ISSN 0099-9660.
  14. ^ "Cerebras Systems and NETL Set New Compute Milestone". HPCwire. November 18, 2020.
  15. ^ Takahashi, Dean (August 19, 2020). "Cerebras puts 'world's largest computer chip' in Lassen supercomputer". VentureBeat.
  16. ^ Hemsoth, Nicole (March 30, 2021). "Neocortex Supercomputer to Put Cerebras CS-1 to the Test". The Next Platform.
  17. ^ Swinhoe, Dan (February 5, 2021). "EPCC chooses Cerebras' massive chip for new supercomputer". Data Center Dynamics.
  18. ^ "Cerebras Systems Expands Global Footprint with New Offices in Tokyo, Japan, and Toronto, Canada" (Press release). Business Wire. September 22, 2020.
  19. ^ a b c Takahashi, Dean (April 20, 2021). "Cerebras launches new AI supercomputing processor with 2.6 trillion transistors". VentureBeat.
  20. ^ a b Ray, Tiernan (April 20, 2021). "Cerebras continues 'absolute domination' of high-end compute, it says, with world's hugest chip two-dot-oh". ZDNet.
  21. ^ Knight, Will (August 24, 2021). "A New Chip Cluster Will Make Massive AI Models Possible". Wired. ISSN 1059-1028.
  22. ^ Khalili, Joel (August 25, 2021). "The world's largest chip is creating AI networks larger than the human brain". TechRadar.
  23. ^ Moore, Samuel K. (August 24, 2021). "Cerebras' Tech Trains "Brain-Scale" AIs". IEEE Spectrum.
  24. ^ Butler, Georgia (March 8, 2022). "Cerebras Systems CS-2 to be used by TotalEnergies for research". Data Center Dynamics.
  25. ^ Martin, Dylan (March 14, 2022). "Cerebras brings CS-2 system to data analysis biz nference". The Register.
  26. ^ "NCSA Deploys Cerebras CS-2 in New HOLL-I Supercomputer for Large-Scale AI". HPCwire. May 31, 2022.
  27. ^ Moss, Sebastian (May 26, 2022). "Leibniz Supercomputing Centre to deploy HPE-Cerebras supercomputer". Data Center Dynamics.
  28. ^ "Peptilogics and Cerebras Systems Partner on AI Solutions to Advance Peptide Therapeutics". HPCwire. August 18, 2021.
  29. ^ a b Pires, Francisco (June 22, 2022). "Cerebras Slays GPUs, Breaks Record for Largest AI Models Trained on a Single Device". Tom's Hardware.
  30. ^ a b Takahashi, Dean (June 22, 2022). "Cerebras Systems sets record for largest AI models ever trained on one device". VentureBeat.
  31. ^ Ray, Tiernan (August 3, 2022). "AI startup Cerebras celebrated for chip triumph where others tried and failed". ZDNet.
  32. ^ Spicer, Dag (August 3, 2022). "The Biggest Chip In the World". Computer History Museum.
  33. ^ "Cerebras Systems Continues Global Expansion with New India Office". EE Times. August 31, 2022.
  34. ^ a b Shah, Agam (September 14, 2022). "Cerebras Proposes AI Megacluster with Billions of AI Compute Cores". HPCwire.
  35. ^ Freund, Karl (September 14, 2022). "New Cerebras Wafer-Scale Cluster Eliminates Months Of Painstaking Work To Build Massive Intelligence". Forbes.
  36. ^ Mann, Tobias (October 18, 2022). "DoE to trial Cerebras AI compute in nuclear weapon sims". The Register.
  37. ^ a b Alcorn, Paul (November 14, 2022). "Cerebras Reveals Andromeda, a 13.5 Million Core AI Supercomputer". Tom's Hardware.
  38. ^ Lee, Jane Lanhee (November 14, 2022). "Silicon Valley chip startup Cerebras unveils AI supercomputer". Reuters.
  39. ^ Ray, Tiernan (November 29, 2022). "AI challenger Cerebras unveils 'pay-per-model' AI cloud service with Cirrascale, Jasper". ZDNet.
  40. ^ "Cerebras and National Energy Tech Lab Set New Milestones for High-Performance, Energy-Efficient Field Equation Modeling Using Simple Python Interface". HPCwire. November 10, 2022.
  41. ^ Jackson, Kevin (November 29, 2022). "Argonne researchers win Gordon Bell Special Prize for adapting language models to track virus variants" (Press release). Argonne National Laboratory.
  42. ^ Nellis, Stephen; Hu, Krystal (July 20, 2023). "Cerebras Systems signs $100 mln AI supercomputer deal with UAE's G42". Reuters.
  43. ^ Lu, Yiwen (July 20, 2023). "An A.I. Supercomputer Whirs to Life, Powered by Giant Computer Chips". The New York Times.
  44. ^ Moore, Samuel K. (July 20, 2023). "Cerebras Introduces Its 2-Exaflop AI Supercomputer". IEEE Spectrum.
  45. ^ "Cerebras and G42 Unveil World's Largest Supercomputer for AI Training with 4 exaFLOPs to Fuel a New Era of Innovation" (Press release). PR Newswire. July 21, 2023.
  46. ^ Takahashi, Dean (March 13, 2024). "Cerebras breaks ground on Condor Galaxy 3, an AI supercomputer that can hit 8 exaFLOPs". VentureBeat.
  47. ^ Cherney, Max A. (August 30, 2023). "UAE's G42 launches open source Arabic language AI model". Reuters.
  48. ^ Booth, Harry (October 30, 2024). "Cerebras Systems Wafer-Scale Engine 3: 200 Best Inventions of 2024". Time.
  49. ^ Genkina, Dina (June 12, 2024). "Giant Chips Give Supercomputers a Run for Their Money". IEEE Spectrum.
  50. ^ Steinberg, Don (May 30, 2024). "TIME100 Most Influential Companies 2024: Cerebras Systems". Time.
  51. ^ Chang, Ollie (June 20, 2024). "Cerebras partners with Dell to challenge Nvidia's AI dominance". DigiTimes. Taipei.
  52. ^ Ray, Tiernan (August 27, 2024). "AI startup Cerebras debuts 'world's fastest inference' service - with a twist". ZDNET.
  53. ^ Eadline, Doug (January 16, 2025). "Cerebras and Mayo Clinic Unveil Advanced Genomic AI Model". HPCwire.
  54. ^ Athow, Desire (January 30, 2025). "DeepSeek now runs on largest AI chip ever to deliver out-of-this world inference performance and I can't wait to try it". TechRadar.
  55. ^ Nellis, Stephen (February 6, 2025). "AI chip firm Cerebras partners with France's Mistral, claims speed record". Reuters.
  56. ^ Nuñez, Michael (February 11, 2025). "Cerebras-Perplexity deal targets $100B search market with ultra-fast AI". VentureBeat.
  57. ^ Nuñez, Michael (March 11, 2025). "Cerebras just announced 6 new AI datacenters that process 40M tokens per second — and it could be bad news for Nvidia". VentureBeat.
  58. ^ Nuñez, Michael (April 29, 2025). "Meta unleashes Llama API running 18x faster than OpenAI: Cerebras partnership delivers 2,600 tokens per second". VentureBeat.
  59. ^ Trueman, Charlotte (April 2, 2025). "Cerebras and Ranovus receive $45m DARPA contract to "solve" compute bottleneck". Data Center Dynamics.
  60. ^ "Cerebras Beats NVIDIA Blackwell in Llama 4 Maverick Inference" (Press release). Business Wire. May 28, 2025.
  61. ^ Shittu, Esther (July 8, 2025). "Cerebras launches Alibaba model, forms key AI partnerships". Informa TechTarget.
  62. ^ Rooney, Jordan; Novet, Kate (January 14, 2026). "Cerebras scores OpenAI deal worth over $10 billion ahead of AI chipmaker's IPO". CNBC.
  63. ^ Trueman, Charlotte (March 13, 2026). "AWS partners with big chip co. Cerebras for AI "inference disaggregation"". Data Center Dynamics.
  64. ^ Lynley, Matthew (December 19, 2016). "A stealthy startup called Cerebras raised around $25 million to build deep learning hardware". TechCrunch.
  65. ^ Martin, Dylan (November 27, 2019). "AI Chip Startup Cerebras Reveals 'World's Fastest AI Supercomputer'". CRN.
  66. ^ "Cerebras Unveils AI Supercomputer-On-A-Chip". Forbes. August 19, 2019.
  67. ^ Foxton, Sally Ward- (April 20, 2021). "Cerebras Crams More Compute Into Second-Gen 'Dinner Plate Sized' Chip". EE Times.
  68. ^ Nellis, Stephen (November 10, 2021). Macfie, Nick (ed.). "AI chip startup Cerebras Systems raises $250 million in funding". Reuters.
  69. ^ Novet, Jordan (September 30, 2025). "AI chip company Cerebras raises $1 billion in pre-IPO funding round". CNBC.
  70. ^ Wang, Echo (May 13, 2026). "Cerebras prices IPO at $185 per share to raise $5.55 billion". Reuters.
  71. ^ https://www.cnbc.com/2026/05/14/cerebras-cbrs-stock-trade-nasdaq-ipo.html
[edit]