Blog, Articles, and discussions

How to deploy and fine-tune DeepSeek models on AWS

By January 30, 2025 • 45

Community Articles

view all

Mixture of Tunable Experts - Behavior Modification of DeepSeek-R1 at Inference Time

and 4 others •

1 day ago

• 16

Argunauts Training Phase I: Continual Pretraining on Synthetic Data

•

1 day ago

Best AI Setups for Multi-Agent Workflows in KaibanJS

•

1 day ago

🌁#88: Can DeepSeek Inspire Global Collaboration?

•

2 days ago

• 3

Adapter l’intelligence artificielle au créole

•

2 days ago

Open-sourcing the Plain English to SQL Pipeline

•

2 days ago

• 1

Fine-tuning SmolLM with Group Relative Policy Optimization (GRPO) by following the Methodologies

•

3 days ago

• 14

Nexus Shift: AI Generated Short Story

•

3 days ago

WTF is Fine-Tuning? (intro4devs) | [2025]

•

3 days ago

• 4

🦸🏻#10: Does Present-Day GenAI Actually Reason?

•

4 days ago

• 5

Blazing-Fast Code Editing via Multi-Layer Speculation

and 3 others •

5 days ago

• 15

Argunauts: Open LLMs that Master Argument Analysis with Argdown

•

5 days ago

• 1

Topic 27: What are Chain-of-Agents and Chain-of-RAG?

and 1 other •

6 days ago

• 10

Adventures in AI

•

6 days ago

Agent Leaderboard: Evaluating AI Agents in Multi-Domain Scenarios

and 1 other •

8 days ago

• 11

Jupyter X Hugging Face

By March 23, 2023 • 2

New ViT and ALIGN Models From Kakao Brain

By March 6, 2023

Hugging Face and AWS partner to make AI more accessible

By February 21, 2023 • 2

Accelerating PyTorch Transformers with Intel Sapphire Rapids, part 2

By February 6, 2023

Accelerating PyTorch Transformers with Intel Sapphire Rapids, part 1

By January 2, 2023 • 2

Zero-shot image segmentation with CLIPSeg

By December 21, 2022 • 6

Faster Training and Inference: Habana Gaudi®2 vs Nvidia A100 80GB

By December 14, 2022 • 2

Pre-Train BERT with Hugging Face Transformers and Habana Gaudi

By August 22, 2022 • 5

Graphcore and Hugging Face Launch New Lineup of IPU-Ready Transformers

By May 26, 2022

Getting Started with Transformers on Habana Gaudi

By April 26, 2022

Habana Labs and Hugging Face Partner to Accelerate Transformer Model Training

By April 12, 2022

Fine-Tune a Semantic Segmentation Model with a Custom Dataset

By March 17, 2022 • 17

Accelerate BERT inference with Hugging Face Transformers and AWS inferentia

By March 16, 2022

Deploy GPT-J 6B for inference using Hugging Face Transformers and Amazon SageMaker

By January 11, 2022

Community Articles

view all

Grok 3 ai : Best AI model now!

•

about 8 hours ago

• 1

Argunauts Training Phase II: Selfplay Finetuning Line-By-Line

•

about 11 hours ago

• 2

Synthetic Face Embeddings: Research Notes and Methodology

and 1 other •

about 15 hours ago

• 1

How to use Sentient’s Dobby-70B

•

about 18 hours ago

Mahjong: Where Grandmas Beat The Best LLMs

•

about 22 hours ago

Mixture of Tunable Experts - Behavior Modification of DeepSeek-R1 at Inference Time

and 4 others •

1 day ago

• 16

Argunauts Training Phase I: Continual Pretraining on Synthetic Data

•

1 day ago

Best AI Setups for Multi-Agent Workflows in KaibanJS

•

1 day ago

🌁#88: Can DeepSeek Inspire Global Collaboration?

•

2 days ago

• 3

Adapter l’intelligence artificielle au créole

•

2 days ago

Open-sourcing the Plain English to SQL Pipeline

•

2 days ago

• 1

Fine-tuning SmolLM with Group Relative Policy Optimization (GRPO) by following the Methodologies

•

3 days ago

• 14

Nexus Shift: AI Generated Short Story

•

3 days ago

WTF is Fine-Tuning? (intro4devs) | [2025]

•

3 days ago

• 4

🦸🏻#10: Does Present-Day GenAI Actually Reason?

•

4 days ago

• 5

Blazing-Fast Code Editing via Multi-Layer Speculation

and 3 others •

5 days ago

• 15

Argunauts: Open LLMs that Master Argument Analysis with Argdown

•

5 days ago

• 1

Topic 27: What are Chain-of-Agents and Chain-of-RAG?

and 1 other •

6 days ago

• 10

Adventures in AI

•

6 days ago

Agent Leaderboard: Evaluating AI Agents in Multi-Domain Scenarios

and 1 other •

8 days ago

• 11