SDK Archives - Cerebras

Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster

Rebecca Lewington — Fri, 07 Apr 2023 17:24:10 +0000

We introduce Cerebras-GPT, a family of open compute-optimal language models scaled from 111M to 13B parameters. We train Cerebras-GPT models on the Eleuther Pile dataset following DeepMind Chinchilla scaling rules for efficient pre-training (highest accuracy for a given compute budget).…

The post Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster appeared first on Cerebras.

Training Giant Neural Networks Using Weight Streaming on Cerebras Wafer-Scale Systems

Rebecca Lewington — Fri, 24 Mar 2023 18:00:44 +0000

In this paper, we survey existing approaches used to scale training to clusters of compute units and explore the limitations of each in the face of giant models. We present a new paradigm for giant model training, called weight streaming,…

The post Training Giant Neural Networks Using Weight Streaming on Cerebras Wafer-Scale Systems appeared first on Cerebras.

Sparse-IFT: Sparse Iso-FLOP Transformations for Maximizing Training Efficiency

Rebecca Lewington — Wed, 22 Mar 2023 16:53:24 +0000

Replacing dense layers with Sparse-IFT leads to significant improvements across computer vision (CV) and natural language processing (NLP) tasks, including ResNet-18 on ImageNet (+3.5%) and GPT-3 Small on WikiText-103 (-0.4 PPL), both matching larger dense model variants with 2x or…

The post Sparse-IFT: Sparse Iso-FLOP Transformations for Maximizing Training Efficiency appeared first on Cerebras.

SPDF: Sparse Pre-training and Dense Fine-tuning for Large Language Models

Rebecca Lewington — Tue, 21 Mar 2023 16:41:45 +0000

Presented at the ICLR 2023 Workshop on Sparsity in Neural Networks.
In this work, we show the benefits of using unstructured weight sparsity to train only a subset of weights during pre-training (Sparse Pre-training) and then recover the representational capacity…

The post SPDF: Sparse Pre-training and Dense Fine-tuning for Large Language Models appeared first on Cerebras.

Wafer-Scale Fast Fourier Transforms

Tin Hoang — Fri, 20 Jan 2023 19:33:42 +0000

We have implemented fast Fourier transforms for one, two, and three-dimensional arrays on the Cerebras CS-2, a system whose memory and processing elements reside on a single silicon wafer. The wafer-scale engine (WSE) encompasses a two-dimensional mesh of roughly 850,000…

The post Wafer-Scale Fast Fourier Transforms appeared first on Cerebras.

GenSLMs: Genome-scale language models reveal SARS-CoV-2 evolutionary dynamics

Tin Hoang — Thu, 24 Nov 2022 04:58:55 +0000

Our work seeks to transform how new and emergent variants of pandemic causing viruses, specially SARS-CoV-2, are identified and classified. By adapting large language models (LLMs) for genomic data, we build genome-scale language models (GenSLMs) which can learn the evolutionary…

The post GenSLMs: Genome-scale language models reveal SARS-CoV-2 evolutionary dynamics appeared first on Cerebras.

Disruptive Changes in Field Equation Modeling: A Simple Interface for Wafer Scale Engines

Tin Hoang — Thu, 29 Sep 2022 03:47:31 +0000

We present a high-level and accessible Application Programming Interface (API) for the solution of field equations on the Cerebras Systems Wafer-Scale Engine (WSE) with over two orders of magnitude performance gain relative to traditional distributed computing approaches. The domain-specific API…

The post Disruptive Changes in Field Equation Modeling: A Simple Interface for Wafer Scale Engines appeared first on Cerebras.

TensorFlow as a DSL for stencil-based computation on the Cerebras Wafer-Scale Engine

Rebecca Lewington — Fri, 26 Aug 2022 17:10:18 +0000

The Cerebras Wafer Scale Engine (WSE) is an accelerator that combines hundreds of thousands of AI-cores onto a single chip. Whilst this technology has been designed for machine learning workloads, the significant amount of available raw compute means that it…

The post TensorFlow as a DSL for stencil-based computation on the Cerebras Wafer-Scale Engine appeared first on Cerebras.

NETL Researchers Work to Unlock Potential of Artificial Intelligence in Climate Modeling

Tin Hoang — Tue, 19 Jul 2022 18:57:48 +0000

esearchers at the U.S. National Energy Technology Laboratory (NETL) are helping the National Center for Atmospheric Research (NCAR) unlock the potential of an advanced artificial intelligence (AI) computing resource to perform critical climate modeling that could lead to better climate…

The post NETL Researchers Work to Unlock Potential of Artificial Intelligence in Climate Modeling appeared first on Cerebras.

Meet the nominees for the 2022 VentureBeat Women in AI Awards!

Tin Hoang — Fri, 08 Jul 2022 21:34:18 +0000

Two Cerebras Systems engineers are finalists for VentureBeat’s Women in AI awards!…

The post Meet the nominees for the 2022 VentureBeat Women in AI Awards! appeared first on Cerebras.

Age Checks, Theft Prevention, Minecraft AI, Autism, Responsible AI

Tin Hoang — Wed, 06 Jul 2022 17:50:21 +0000

…

The post Age Checks, Theft Prevention, Minecraft AI, Autism, Responsible AI appeared first on Cerebras.

RevBiFPN: The Fully Reversible Bidirectional Feature Pyramid Network

Tin Hoang — Tue, 28 Jun 2022 21:38:33 +0000

This work introduces the RevSilo, the first reversible module for bidirectional multi-scale feature fusion. Like other reversible methods, RevSilo eliminates the need to store hidden activations by recomputing them. Existing reversible methods, however, do not apply to multi-scale feature fusion…

The post RevBiFPN: The Fully Reversible Bidirectional Feature Pyramid Network appeared first on Cerebras.

Cerebras trains 20 billion parameter AI model on a single system, sets new record

Tin Hoang — Mon, 27 Jun 2022 22:01:37 +0000

…

The post Cerebras trains 20 billion parameter AI model on a single system, sets new record appeared first on Cerebras.

Training a 20–Billion Parameter AI Model on a Single Processor

Liz Wu — Fri, 24 Jun 2022 22:12:32 +0000

Cerebras has shown off the capabilities of its second–generation wafer–scale engine, announcing it has set the record for the largest AI model ever trained on a single device.
For the first time, a natural language processing network with 20 billion…

The post Training a 20–Billion Parameter AI Model on a Single Processor appeared first on Cerebras.

Cerebras breaks record for largest AI models trained on a single device

Tin Hoang — Thu, 23 Jun 2022 23:12:47 +0000

The post Cerebras breaks record for largest AI models trained on a single device appeared first on Cerebras.

Why The Cerebras CS-2 Machine is a Big Deal

Liz Wu — Thu, 23 Jun 2022 22:06:07 +0000

…

The post Why The Cerebras CS-2 Machine is a Big Deal appeared first on Cerebras.

Cerebras Slays GPUs, Breaks Record for Largest AI Models Trained on a Single Device

Tin Hoang — Wed, 22 Jun 2022 14:25:41 +0000

Democratizing large AI Models without HPC scaling requirements.…

The post Cerebras Slays GPUs, Breaks Record for Largest AI Models Trained on a Single Device appeared first on Cerebras.

Cerebras Systems Thinks Forward on AI Chips as it Claims Performance Win

Liz Wu — Wed, 22 Jun 2022 14:21:45 +0000

…

The post Cerebras Systems Thinks Forward on AI Chips as it Claims Performance Win appeared first on Cerebras.

Cerebras Systems sets record for largest AI models ever trained on one device

Liz Wu — Wed, 22 Jun 2022 14:19:16 +0000

…

The post Cerebras Systems sets record for largest AI models ever trained on one device appeared first on Cerebras.

Cerebras just built a big chip that could democratize AI

Liz Wu — Wed, 22 Jun 2022 14:11:28 +0000

Chip startup Cerebras has developed a foot-wide piece of silicon, compared to average chips measured in millimeters, that makes training AI cheap, and easy.…

The post Cerebras just built a big chip that could democratize AI appeared first on Cerebras.

#77 – VITALIY CHILEY (Cerebras)

Tin Hoang — Thu, 16 Jun 2022 23:58:41 +0000

…

The post #77 – VITALIY CHILEY (Cerebras) appeared first on Cerebras.

Eine neue Maschine für KI und HPC

Tin Hoang — Thu, 02 Jun 2022 18:42:32 +0000

…

The post Eine neue Maschine für KI und HPC appeared first on Cerebras.

NCSA Deploys Cerebras CS-2 in New HOLL-I Supercomputer for Large-Scale AI

Tin Hoang — Wed, 01 Jun 2022 00:19:48 +0000

…

The post NCSA Deploys Cerebras CS-2 in New HOLL-I Supercomputer for Large-Scale AI appeared first on Cerebras.

Leading Supercomputer Sites Choose Cerebras for AI Acceleration

Tin Hoang — Wed, 01 Jun 2022 00:12:23 +0000

…

The post Leading Supercomputer Sites Choose Cerebras for AI Acceleration appeared first on Cerebras.

LRZ Adds Mega AI System as It Stacks up on Future Computing Systems

Rebecca Lewington — Fri, 27 May 2022 04:35:12 +0000

…

The post LRZ Adds Mega AI System as It Stacks up on Future Computing Systems appeared first on Cerebras.

HPE, Cerebras build AI supercomputer for scientific research

Rebecca Lewington — Fri, 27 May 2022 04:31:27 +0000

…

The post HPE, Cerebras build AI supercomputer for scientific research appeared first on Cerebras.

HPE is building a rapid AI supercomputer powered by the world’s largest CPU

Tin Hoang — Fri, 27 May 2022 00:27:46 +0000

…

The post HPE is building a rapid AI supercomputer powered by the world’s largest CPU appeared first on Cerebras.

Bio-IT World Judges, Community Honor Six Outstanding New Products

Rebecca Lewington — Fri, 06 May 2022 00:20:42 +0000

…

The post Bio-IT World Judges, Community Honor Six Outstanding New Products appeared first on Cerebras.

Argonne Talks AI Accelerators for COVID Research

Rebecca Lewington — Fri, 29 Apr 2022 00:03:11 +0000

…

The post Argonne Talks AI Accelerators for COVID Research appeared first on Cerebras.

Cerebras Systems’ dinner plate-sized chips are revolutionizing the field of AI

Rebecca Lewington — Thu, 28 Apr 2022 23:59:47 +0000

When the chip is the size of a big pizza pie… that’s Cerebras…

The post Cerebras Systems’ dinner plate-sized chips are revolutionizing the field of AI appeared first on Cerebras.

Accelerating insights in large scale AI projects

Rebecca Lewington — Mon, 25 Apr 2022 23:38:27 +0000

…

The post Accelerating insights in large scale AI projects appeared first on Cerebras.

A Templated C++ Interface for ISL

Rebecca Lewington — Sat, 23 Apr 2022 04:24:02 +0000

Polyhedral libraries typically support only a very limited collection of types for representing objects, corresponding to broad mathematical classes such as sets, binary relations and functions.…

The post A Templated C++ Interface for ISL appeared first on Cerebras.

Cerebras, TotalEnergies Announce Stencil Algorithm Leap

Rebecca Lewington — Fri, 22 Apr 2022 16:38:27 +0000

…

The post Cerebras, TotalEnergies Announce Stencil Algorithm Leap appeared first on Cerebras.

How healthcare and pharmaceutical research will accelerate through AI

Rebecca Lewington — Wed, 20 Apr 2022 22:28:48 +0000

…

The post How healthcare and pharmaceutical research will accelerate through AI appeared first on Cerebras.

Cerebras Expands Support for Pytorch and Tensorflow Machine Learning Frameworks on the Wafer-Scale Engine 2 Processors that Power Its CS-2 System

Rebecca Lewington — Wed, 20 Apr 2022 22:26:50 +0000

Deep learning has emerged as our generation’s most critical computing job. Tasks that were once the unique realm of humans are now regularly executed at human or superhuman levels by computers.…

The post Cerebras Expands Support for Pytorch and Tensorflow Machine Learning Frameworks on the Wafer-Scale Engine 2 Processors that Power Its CS-2 System appeared first on Cerebras.

Accelerating Discovery: Andrew Feldman, Co-Founder and CEO, Cerebras Systems

Rebecca Lewington — Wed, 20 Apr 2022 21:51:11 +0000

New hardware can substantially increase the speed and efficiency of deep neural network training. To guide the development of future hardware architectures, it is pertinent to explore the hardware and machine learning properties of alternative training algorithms.…

The post Accelerating Discovery: Andrew Feldman, Co-Founder and CEO, Cerebras Systems appeared first on Cerebras.

The World’s Largest Chip Just Received A Major Machine Learning-Flavored Upgrade

Tin Hoang — Fri, 15 Apr 2022 19:03:02 +0000

…

The post The World’s Largest Chip Just Received A Major Machine Learning-Flavored Upgrade appeared first on Cerebras.

PSC UPGRADES NEOCORTEX AI SUPERCOMPUTER WITH NEW CEREBRAS ENGINES

Rebecca Lewington — Thu, 14 Apr 2022 22:35:43 +0000

If you were going to build an electronic brain in 2022, it might look something like the Neocortex supercomputer at the Pittsburgh Supercomputing Center at Carnegie Mellon University. That machine, which was only installed last year, has now got a…

The post PSC UPGRADES NEOCORTEX AI SUPERCOMPUTER WITH NEW CEREBRAS ENGINES appeared first on Cerebras.

Massively scalable stencil algorithm

Tin Hoang — Thu, 07 Apr 2022 17:54:07 +0000

Stencil computations lie at the heart of many scientific and industrial applications. Unfortunately, stencil algorithms perform poorly on machines with cache based memory hierarchy, due to low reuse of memory accesses. This work shows that for stencil computation a novel…

The post Massively scalable stencil algorithm appeared first on Cerebras.

Powering Extreme-Scale HPC with Cerebras WaferScale Accelerators

Rebecca Lewington — Wed, 06 Apr 2022 23:54:21 +0000

In this paper, we will explore the challenges facing HPC developers today and show how the Cerebras architecture can help to accelerate sparse linear algebra and tensor workloads, stencilbased partial differential equation (PDE) solvers, N-body problems, and spectral algorithms such…

The post Powering Extreme-Scale HPC with Cerebras WaferScale Accelerators appeared first on Cerebras.

The Cerebras Software Development Kit: A Technical Overview

Rebecca Lewington — Tue, 08 Feb 2022 01:06:52 +0000

Cerebras has introduced a new software development kit (SDK) which allows anyone to take advantage of the strengths of the CS-2 system. Developers can use the Cerebras SDK to create custom kernels for their standalone applications or modify the kernel…

The post The Cerebras Software Development Kit: A Technical Overview appeared first on Cerebras.

Epigenomic language models powered by Cerebras

Rebecca Lewington — Thu, 27 Jan 2022 04:25:10 +0000

Large scale self-supervised pre-training of Transformer language models has advanced the field of Natural Language Processing and shown promise in cross-application to the biological `languages’ of proteins and DNA. Learning effective representations of DNA sequences using large genomic sequence corpuses…

The post Epigenomic language models powered by Cerebras appeared first on Cerebras.

Intelligent Resolution: Integrating Cryo-EM with AI-driven Multi-resolution Simulations to Observe the SARS-CoV-2 Replication-Transcription Machinery in Action

Rebecca Lewington — Thu, 18 Nov 2021 05:25:03 +0000

The severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) replication transcription complex (RTC) is a multi-domain protein responsible for replicating and transcribing the viral mRNA inside a human cell. Attacking RTC function with pharmaceutical com- pounds is a pathway to treating COVID-19.…

The post Intelligent Resolution: Integrating Cryo-EM with AI-driven Multi-resolution Simulations to Observe the SARS-CoV-2 Replication-Transcription Machinery in Action appeared first on Cerebras.

Cerebras – Eye On AI

Rebecca Lewington — Mon, 18 Oct 2021 14:52:08 +0000

Andrew Feldman, one of the founders and CEO of Cerebras Systems, talks about the company’s wafer-scale computer chip optimized for machine learning and about the network of chips that company has built that has as much computing power as a…

The post Cerebras – Eye On AI appeared first on Cerebras.

Cerebras Systems Enables Brain-scale AI

Rebecca Lewington — Wed, 22 Sep 2021 00:12:54 +0000

This research paper explores Cerebras System’s approach to create a brain-scale AI and the new technologies that could enable that feat. But first, let’s put this discussion into the proper context. Just how big is a 120 trillion-parameter model?…

The post Cerebras Systems Enables Brain-scale AI appeared first on Cerebras.

Stream-AI-MD: streaming AI-driven adaptive molecular simulations for heterogeneous computing platforms

Rebecca Lewington — Tue, 06 Jul 2021 04:02:23 +0000

Emerging hardware tailored for artificial intelligence (AI) and machine learning (ML) methods provide novel means to couple them with traditional high performance computing (HPC) workflows involving molecular dynamics (MD) simulations. We propose Stream-AI-MD, a novel instance of applying deep learning…

The post Stream-AI-MD: streaming AI-driven adaptive molecular simulations for heterogeneous computing platforms appeared first on Cerebras.

Limits to Scale-Out for Training Language Models

Rebecca Lewington — Fri, 25 Jun 2021 00:23:58 +0000

Natural language processing has revolutionized how data is consumed, meaning that computational demand has skyrocketed. Companies in every industry are using graphics processing unit (GPU) clusters to keep up. But is this really the best solution?…

The post Limits to Scale-Out for Training Language Models appeared first on Cerebras.

Train Large BERT Models Faster with Cerebras Systems

Rebecca Lewington — Tue, 25 May 2021 00:39:22 +0000

Unstructured text is one of the largest human-generated data sources. Web data, academic publications, emails, traditional media, texts, instant messages, digital records, social media — all hold an enormous volume of unstructured text.…

The post Train Large BERT Models Faster with Cerebras Systems appeared first on Cerebras.

Cerebras Systems: Achieving Industry Best AI Performance Through A Systems Approach

Rebecca Lewington — Wed, 07 Apr 2021 00:51:34 +0000

The CS-2 is a system solution that consists of innovations across three dimensions: a) the second
generation Cerebras Wafer Scale Engine (WSE-2) — the industry’s largest and only multi-trilliontransistor processor, b) the Cerebras System and c) the Cerebras software platform.…

The post Cerebras Systems: Achieving Industry Best AI Performance Through A Systems Approach appeared first on Cerebras.

Memory Efficient 3D U-Net with Reversible Mobile Inverted Bottlenecks for Brain Tumor Segmentation

Rebecca Lewington — Sat, 06 Mar 2021 04:40:10 +0000

We propose combining memory saving techniques with traditional U-Net architectures to increase the complexity of the models on the Brain Tumor Segmentation (BraTS) challenge. The BraTS challenge consists of a 3D segmentation of a 240 240 155 4 input image…

The post Memory Efficient 3D U-Net with Reversible Mobile Inverted Bottlenecks for Brain Tumor Segmentation appeared first on Cerebras.

Pipelined Backpropagation at Scale: Training Large Models without Batches

Rebecca Lewington — Mon, 01 Mar 2021 18:29:17 +0000

The post Pipelined Backpropagation at Scale: Training Large Models without Batches appeared first on Cerebras.

System Integration of Neocortex, a Unique, Scalable AI Platform

Rebecca Lewington — Thu, 04 Feb 2021 05:21:34 +0000

The Pittsburgh Supercomputing Center, in partnership with Cerebras Systems and Hewlett Packard Enterprise, has deployed Neocortex, an innovative computing platform that accelerates scientific discovery by vastly shortening the time required for deep learning training and fosters greater integration of deep…

The post System Integration of Neocortex, a Unique, Scalable AI Platform appeared first on Cerebras.

EPCC Selects Cerebras Systems AI Supercomputer to Rapidly Accelerate AI Research

Rebecca Lewington — Thu, 04 Feb 2021 01:22:55 +0000

…

The post EPCC Selects Cerebras Systems AI Supercomputer to Rapidly Accelerate AI Research appeared first on Cerebras.

Fast Stencil-Code Computation on a Wafer-Scale Processor

Rebecca Lewington — Fri, 23 Oct 2020 04:00:21 +0000

The performance of CPU-based and GPU-based systems is often low for PDE codes, where large, sparse, and often structured systems of linear equations must be solved. Iterative solvers are limited by data movement, both between caches and memory and between…

The post Fast Stencil-Code Computation on a Wafer-Scale Processor appeared first on Cerebras.

Fast Stencil-Code Computation on a Wafer-Scale Processor

Tin Hoang — Wed, 07 Oct 2020 14:55:41 +0000

The post Fast Stencil-Code Computation on a Wafer-Scale Processor appeared first on Cerebras.

The curious case of developmental BERTology: On sparsity, transfer learning, generalization and the brain

Rebecca Lewington — Wed, 08 Jul 2020 03:57:12 +0000

In this essay, we explore a point of intersection between deep learning and neuroscience, through the lens of large language models, transfer learning and network compression.…

The post The curious case of developmental BERTology: On sparsity, transfer learning, generalization and the brain appeared first on Cerebras.

Generating SIMD Instructions for Cerebras CS-1 using Polyhedral Compilation Techniques

Rebecca Lewington — Sun, 23 Feb 2020 05:03:52 +0000

The Cerebras CS-1 is a computing system based on a waferscale processor having nearly 400,000 compute cores. It is intended for training of and inference on deep neural networks.…

The post Generating SIMD Instructions for Cerebras CS-1 using Polyhedral Compilation Techniques appeared first on Cerebras.

Online Normalization for Training Neural Networks

Rebecca Lewington — Fri, 29 Nov 2019 23:15:48 +0000

Polyhedral libraries typically support only a very limited collection of types for representing objects, corresponding to broad mathematical classes such as sets, binary relations and functions.…

The post Online Normalization for Training Neural Networks appeared first on Cerebras.

Online Normalization for Training Neural Networks, NeurIPS 2019

Rebecca Lewington — Thu, 16 May 2019 03:48:47 +0000

Online Normalization is a new technique for normalizing the hidden activations of a neural network. Like Batch Normalization, it normalizes the sample dimension. While Online Normalization does not use batches, it is as accurate as Batch Normalization.…

The post Online Normalization for Training Neural Networks, NeurIPS 2019 appeared first on Cerebras.