Cerebras

Enterprise

company

Verified

https://www.cerebras.net/

CerebrasSystems

Cerebras

AI & ML interests

None defined yet.

Recent Activity

daniel-cerebras updated a Space 4 months ago

cerebras/chain-of-thought

rohand updated a dataset 5 months ago

cerebras/HybridDialogue

rohand updated a dataset 5 months ago

cerebras/TAT-QA-Arithmetic-CoT

View all activity

cerebras's activity

daniel-cerebras

updated a Space 4 months ago

Chain Of Thought

rohand

updated 2 datasets 5 months ago

cerebras/HybridDialogue

Viewer • Updated Aug 19, 2024 • 19.9k • 53 • 2

cerebras/TAT-QA-Arithmetic-CoT

Viewer • Updated Aug 19, 2024 • 8.33k • 51 • 4

rohand

updated 3 models 5 months ago

cerebras/Dragon-DocChat-Context-Encoder

Updated Aug 16, 2024 • 9 • 2

cerebras/Dragon-DocChat-Query-Encoder

Updated Aug 16, 2024 • 5 • 1

cerebras/Llama3-DocChat-1.0-8B

Text Generation • Updated Aug 16, 2024 • 135 • 67

qanthony

authored a paper 5 months ago

Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters

Paper • 2408.04093 • Published Aug 7, 2024 • 4

abhaygupta

authored a paper 5 months ago

DAiSEE: Towards User Engagement Recognition in the Wild

Paper • 1609.01885 • Published Sep 7, 2016

qanthony

authored 4 papers 7 months ago

Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence

Paper • 2404.05892 • Published Apr 8, 2024 • 32

Zyda: A 1.3T Dataset for Open Language Modeling

Paper • 2406.01981 • Published Jun 4, 2024 • 3

Comparative Study of Large Language Model Architectures on Frontier

Paper • 2402.00691 • Published Feb 1, 2024

Simple and Scalable Strategies to Continually Pre-train Large Language Models

Paper • 2403.08763 • Published Mar 13, 2024 • 49

vithursant

authored 3 papers 7 months ago

MediSwift: Efficient Sparse Pre-trained Biomedical Language Models

Paper • 2403.00952 • Published Mar 1, 2024

Memory Efficient 3D U-Net with Reversible Mobile Inverted Bottlenecks for Brain Tumor Segmentation

Paper • 2104.09648 • Published Apr 19, 2021

RevBiFPN: The Fully Reversible Bidirectional Feature Pyramid Network

Paper • 2206.14098 • Published Jun 28, 2022

abhaygupta

authored 4 papers 8 months ago

RevBiFPN: The Fully Reversible Bidirectional Feature Pyramid Network

Paper • 2206.14098 • Published Jun 28, 2022

SPDF: Sparse Pre-training and Dense Fine-tuning for Large Language Models

Paper • 2303.10464 • Published Mar 18, 2023 • 1

Sparse Iso-FLOP Transformations for Maximizing Training Efficiency

Paper • 2303.11525 • Published Mar 21, 2023 • 1

Enabling High-Sparsity Foundational Llama Models with Efficient Pretraining and Deployment

Paper • 2405.03594 • Published May 6, 2024 • 7

YX-Cerebras

updated a model 8 months ago

cerebras/Cerebras-GPT-Intermediate

Text Generation • Updated Apr 23, 2024