Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
PatrickHaller
/
ngme-llama-264M
like
0
Text Generation
Transformers
PyTorch
allenai/c4
English
ngme
Inference Endpoints
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
Edit model card
NGME-LLama 264M
NGME-LLama 264M
Trained on 4 A6000 for ~4 days
Trained ~4 Billion (4 * 16 * 768 * 100_000) Tokens
On C4 Corpus
Downloads last month
9
Inference Examples
Text Generation
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to
Inference Endpoints (dedicated)
instead.
Dataset used to train
PatrickHaller/ngme-llama-264M
allenai/c4
Viewer
•
Updated
Jan 9
•
10.4B
•
542k
•
315