Large Language Model - Cerebras https://www.cerebras.net/category/large-language-model/ Wed, 15 May 2024 18:32:19 +0000 en-US hourly 1 https://wordpress.org/?v=6.3.2 https://www.cerebras.net/wp-content/uploads/2022/05/cropped-cerebras-logo-fav-32x32.png Large Language Model - Cerebras https://www.cerebras.net/category/large-language-model/ 32 32 Introducing Sparse Llama: 70% Smaller, 3x Faster, Full Accuracy https://www.cerebras.net/blog/introducing-sparse-llama-70-smaller-3x-faster-full-accuracy Wed, 15 May 2024 13:00:52 +0000 https://www.cerebras.net/?p=105462

The post Introducing Sparse Llama: 70% Smaller, 3x Faster, Full Accuracy appeared first on Cerebras.

]]>
Cerebras CS-3 vs. Nvidia B200: 2024 AI Accelerators Compared https://www.cerebras.net/blog/cerebras-cs-3-vs-nvidia-b200-2024-ai-accelerators-compared Fri, 12 Apr 2024 20:35:32 +0000 https://www.cerebras.net/?p=105391

The post Cerebras CS-3 vs. Nvidia B200: 2024 AI Accelerators Compared appeared first on Cerebras.

]]>
Cerebras CS-3: the world’s fastest and most scalable AI accelerator https://www.cerebras.net/blog/cerebras-cs3 Tue, 12 Mar 2024 23:52:34 +0000 https://www.cerebras.net/?p=105336

The post Cerebras CS-3: the world’s fastest and most scalable AI accelerator appeared first on Cerebras.

]]>
Cerebras and Qualcomm Unleash ~10X Inference Performance Boost with Hardware-Aware LLM Training https://www.cerebras.net/blog/cerebras-qualcomm-10x-inference-aware-training Mon, 11 Mar 2024 21:24:25 +0000 https://www.cerebras.net/?p=105329

The post Cerebras and Qualcomm Unleash ~10X Inference Performance Boost with Hardware-Aware LLM Training appeared first on Cerebras.

]]>
Key Insights from the 1st Multilingual Workshop https://www.cerebras.net/blog/key-insights-from-the-1st-multilingual-workshop Tue, 06 Feb 2024 03:01:55 +0000 https://www.cerebras.net/?p=105195

The post Key Insights from the 1st Multilingual Workshop appeared first on Cerebras.

]]>
Sparsity Made Easy – Introducing the Cerebras PyTorch Sparsity Library https://www.cerebras.net/blog/sparsity-made-easy-introducing-the-cerebras-pytorch-sparsity-library Mon, 05 Feb 2024 22:03:41 +0000 https://www.cerebras.net/?p=105180

The post Sparsity Made Easy – Introducing the Cerebras PyTorch Sparsity Library appeared first on Cerebras.

]]>
Cerebras announces support for NSF’s NAIRR Pilot, aimed at advancing our nation’s leadership in AI computing and research https://www.cerebras.net/blog/cerebras-announces-support-for-nsfs-nairr-pilot-aimed-at-advancing-our-nations-leadership-in-ai-computing-and-research Wed, 24 Jan 2024 16:05:39 +0000 https://www.cerebras.net/?p=105173

The post Cerebras announces support for NSF’s NAIRR Pilot, aimed at advancing our nation’s leadership in AI computing and research appeared first on Cerebras.

]]>
Introducing gigaGPT: GPT-3 sized models in 565 lines of code https://www.cerebras.net/blog/introducing-gigagpt-gpt-3-sized-models-in-565-lines-of-code Fri, 29 Dec 2023 23:27:08 +0000 https://www.cerebras.net/?p=105105

The post Introducing gigaGPT: GPT-3 sized models in 565 lines of code appeared first on Cerebras.

]]>
Fine-Tuning Language Models Using Direct Preference Optimization https://www.cerebras.net/blog/fine-tuning-language-models-using-direct-preference-optimization Fri, 29 Dec 2023 22:47:12 +0000 https://www.cerebras.net/?p=105099

The post Fine-Tuning Language Models Using Direct Preference Optimization appeared first on Cerebras.

]]>
Cerebras 2024 Predictions for Generative AI, LLMs, and HPC https://www.cerebras.net/blog/cerebras-2024-predictions-for-generative-ai-llms-and-hpc Fri, 29 Dec 2023 22:38:00 +0000 https://www.cerebras.net/?p=105096

The post Cerebras 2024 Predictions for Generative AI, LLMs, and HPC appeared first on Cerebras.

]]>
Five Reasons to Join Cerebras in 2024 https://www.cerebras.net/blog/5-reasons-to-join-cerebras Fri, 08 Dec 2023 23:07:13 +0000 https://www.cerebras.net/?p=105154

The post Five Reasons to Join Cerebras in 2024 appeared first on Cerebras.

]]>
Cerebras Pioneers Ethical AI Development through Collaborative AI Initiatives  https://www.cerebras.net/blog/cerebras-drives-collaborative-ai-initiatives-for-an-ethically-grounded-future Tue, 05 Dec 2023 15:00:28 +0000 https://www.cerebras.net/?p=105063

The post Cerebras Pioneers Ethical AI Development through Collaborative AI Initiatives  appeared first on Cerebras.

]]>
Cerebras Software Release 2.0: 50% Faster Training, PyTorch 2.0 Support, Diffusion Transformers, and More https://www.cerebras.net/blog/cerebras-software-release-2.0-50-faster-training-pytorch-2.0-support-diffusion-transformers-and-more Fri, 10 Nov 2023 17:00:47 +0000 https://www.cerebras.net/?p=104998

The post Cerebras Software Release 2.0: 50% Faster Training, PyTorch 2.0 Support, Diffusion Transformers, and More appeared first on Cerebras.

]]>
How we fine-tuned Llama2-70B to pass the US Medical License Exam in a week https://www.cerebras.net/blog/how-we-fine-tuned-llama2-70b-to-pass-the-us-medical-license-exam-in-a-week/ Thu, 12 Oct 2023 22:16:15 +0000 https://www.cerebras.net/?p=104960

The post How we fine-tuned Llama2-70B to pass the US Medical License Exam in a week appeared first on Cerebras.

]]>
Jais: a New Pinnacle in Open Arabic NLP https://www.cerebras.net/blog/jais-a-new-pinnacle-in-open-arabic-nlp Tue, 05 Sep 2023 18:17:26 +0000 https://www.cerebras.net/?p=104915

The post Jais: a New Pinnacle in Open Arabic NLP appeared first on Cerebras.

]]>
BTLM-3B-8K: 7B Performance in a 3 Billion Parameter Model https://www.cerebras.net/blog/btlm-3b-8k-7b-performance-in-a-3-billion-parameter-model/ Mon, 24 Jul 2023 18:00:35 +0000 https://www.cerebras.net/?p=104831

The post BTLM-3B-8K: 7B Performance in a 3 Billion Parameter Model appeared first on Cerebras.

]]>
Accelerating Large Language Model Training with Variable Sparse Pre-training and Dense Fine-tuning https://www.cerebras.net/blog/accelerating-llm-training-with-variable-sparse-pre-training-and-dense-fine-tuning/ Sat, 22 Jul 2023 15:10:11 +0000 https://www.cerebras.net/?p=104832

The post Accelerating Large Language Model Training with Variable Sparse Pre-training and Dense Fine-tuning appeared first on Cerebras.

]]>
Variable Sequence Length Training for Long-Context Large Language Models https://www.cerebras.net/blog/variable-sequence-length-training-for-long-context-large-language-models/ Sat, 22 Jul 2023 15:00:49 +0000 https://www.cerebras.net/?p=104833

The post Variable Sequence Length Training for Long-Context Large Language Models appeared first on Cerebras.

]]>
Introducing Condor Galaxy 1: a 4 exaFLOPS Supercomputer for Generative AI https://www.cerebras.net/blog/introducing-condor-galaxy-1-a-4-exaflop-supercomputer-for-generative-ai/ Thu, 20 Jul 2023 12:50:35 +0000 https://www.cerebras.net/?p=104797

The post Introducing Condor Galaxy 1: a 4 exaFLOPS Supercomputer for Generative AI appeared first on Cerebras.

]]>
SlimPajama: A 627B token, cleaned and deduplicated version of RedPajama https://www.cerebras.net/blog/slimpajama-a-627b-token-cleaned-and-deduplicated-version-of-redpajama Fri, 09 Jun 2023 16:00:58 +0000 https://www.cerebras.net/?p=104755

The post SlimPajama: A 627B token, cleaned and deduplicated version of RedPajama appeared first on Cerebras.

]]>
Efficient Large-Scale GPT Training Using a Cerebras Wafer-Scale Cluster https://www.cerebras.net/blog/efficient-large-scale-gpt-training-using-a-cerebras-wafer-scale-cluster Wed, 24 May 2023 02:28:24 +0000 https://www.cerebras.net/?p=104728

The post Efficient Large-Scale GPT Training Using a Cerebras Wafer-Scale Cluster appeared first on Cerebras.

]]>
Fine-Tuning with Cerebras AI Model Studio Launchpad https://www.cerebras.net/blog/fine-tuning-with-cerebras-ai-model-studio Mon, 17 Apr 2023 13:00:52 +0000 https://www.cerebras.net/?p=104681

The post Fine-Tuning with Cerebras AI Model Studio Launchpad appeared first on Cerebras.

]]>