159 66 219

Philipp Schmid

philschmid

https://www.philschmid.de

AI & ML interests

None yet

Recent Activity

liked a dataset about 4 hours ago

megrisdal/llms-txt

New activity about 9 hours ago

huggingface/documentation-images:Upload hugs-marketplace-listing-inf2.png

updated a dataset 3 days ago

huggingface/documentation-images

View all activity

Articles

Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints

May 1

• 68

Welcome Llama 3 - Meta's new open LLM

Apr 18

• 279

Making thousands of open LLMs bloom in the Vertex AI Model Garden

Apr 10

• 18

CodeGemma - an official Google release for code LLMs

Apr 9

• 99

Bringing serverless GPU inference to Hugging Face users

Apr 2

• 11

Easily Train Models with H100 GPUs on NVIDIA DGX Cloud

Mar 18

• 7

Welcome Gemma - Google's new open LLM

Feb 21

• 18

From OpenAI to Open LLMs with Messages API

Feb 8

• 12

Hugging Face Text Generation Inference available for AWS Inferentia2

Feb 1

• 5

Hugging Face and Google partner for open AI collaboration

Jan 25

• 4

Mixture of Experts Explained

Dec 11, 2023

• 199

Welcome Mixtral - a SOTA Mixture of Experts on Hugging Face

Dec 11, 2023

• 11

Deploy Embedding Models with Hugging Face Inference Endpoints

Oct 24, 2023

• 2

Llama 2 on Amazon SageMaker a Benchmark

Sep 26, 2023

Fine-tuning Llama 2 70B using PyTorch FSDP

Sep 13, 2023

• 14

Spread Your Wings: Falcon 180B is here

Sep 6, 2023

• 4

Code Llama: Llama 2 learns to code

Aug 25, 2023

• 8

Introducing SafeCoder

Aug 22, 2023

Hugging Face Platform on the AWS Marketplace: Pay with your AWS Account

Aug 10, 2023

Llama 2 is here - get it on Hugging Face

Jul 18, 2023

• 22

Deploy LLMs with Hugging Face Inference Endpoints

Jul 4, 2023

• 11

The Falcon has landed in the Hugging Face ecosystem

Jun 5, 2023

• 9

Introducing the Hugging Face LLM Inference Container for Amazon SageMaker

May 31, 2023

• 2

Hugging Face Collaborates with Microsoft to Launch Hugging Face Model Catalog on Azure

May 24, 2023

Creating a Coding Assistant with StarCoder

May 9, 2023

• 2

Accelerating Hugging Face Transformers with AWS Inferentia2

Apr 17, 2023

Hugging Face and AWS partner to make AI more accessible

Feb 21, 2023

• 2

Pre-Train BERT with Hugging Face Transformers and Habana Gaudi

Aug 22, 2022

• 5

Convert Transformers to ONNX with Hugging Face Optimum

Jun 22, 2022

• 3

Accelerated Inference with Optimum and Transformers Pipelines

May 10, 2022

• 2

Accelerate BERT inference with Hugging Face Transformers and AWS inferentia

Mar 16, 2022

Case Study: Millisecond Latency using Hugging Face Infinity and modern CPUs

Jan 13, 2022

• 2

Deploy GPT-J 6B for inference using Hugging Face Transformers and Amazon SageMaker

Jan 11, 2022

Few-shot learning in practice: GPT-NEO and the 🤗 Accelerated Inference API

Jun 3, 2021

• 3

Distributed Training: Train BART/T5 for Summarization using 🤗 Transformers and Amazon SageMaker

Apr 8, 2021

Organizations

philschmid's activity

New activity in huggingface/documentation-images about 9 hours ago

Upload hugs-marketplace-listing-inf2.png

#398 opened about 9 hours ago by

alvarobartt

New activity in huggingface/documentation-images about 1 month ago

Upload hugs-banner.png

#383 opened about 1 month ago by

alvarobartt

Add Digital Ocean resources (low quality)

#382 opened about 1 month ago by

alvarobartt

Update HUGS images for AWS Marketplace

#381 opened about 1 month ago by

alvarobartt

New activity in facebook/Self-taught-evaluator-DPO-data 2 months ago

Please provide chosen & rejected inputs in the messages format

#2 opened 2 months ago by

lewtun

New activity in utter-project/EuroLLM-1.7B 2 months ago

Update README.md

#1 opened 2 months ago by

philschmid

New activity in huggingface/documentation-images 3 months ago

Upload `google-cloud/thumbnail.png`

#365 opened 3 months ago by

alvarobartt

New activity in mattshumer/Reflection-Llama-3.1-70B 3 months ago

Fix library for correct "use" and "deploy" options

#15 opened 3 months ago by

philschmid

New activity in featherless-ai/try-this-model 3 months ago

Reflection not using correct system prompt

#3 opened 3 months ago by

philschmid

New activity in philschmid/llm-pricing 3 months ago

Update src/lib/data.ts

#11 opened 3 months ago by

wassemgtk

Update src/lib/data.ts

#10 opened 3 months ago by

yuvalai21

New activity in hf-doc-build/doc-build 3 months ago

Create Google-Cloud-Containers/_versions.yml

#26 opened 3 months ago by

alvarobartt

New activity in huggingface/documentation-images 3 months ago

Add `thumnail.png` for `Google-Cloud-Containers`

#358 opened 3 months ago by

alvarobartt

New activity in philschmid/llm-pricing 4 months ago

New model provider: Novita AI

#8 opened 4 months ago by

Jason12234

New activity in mistralai/Mistral-Nemo-Base-2407 4 months ago

1-click to GCP Vertex AI Endpoint fails

#10 opened 4 months ago by

aflansburg

New activity in philschmid/llm-pricing 4 months ago

Update src/lib/data.ts

#5 opened 5 months ago by

yuvalai21

New activity in zeitgeist-ai/financial-rag-nvidia-sec 5 months ago

Librarian Bot: Add language metadata for dataset

#2 opened 5 months ago by

librarian-bot

commented a paper 5 months ago

Let the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language Models

Paper • 2407.01906 • Published Jul 2 • 34 •

New activity in philschmid/llm-pricing 5 months ago

Small difference in the IBM WatsonX price compared to Excel

#3 opened 5 months ago by

paul-roro589

commented a paper 5 months ago

Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems

Paper • 2407.01370 • Published Jul 1 • 86 •

Philipp Schmid

AI & ML interests

Recent Activity

Articles

Introducing HUGS - Scale your AI with Open Models

Llama can now see and run on your device - welcome Llama 3.2

Deploy Meta Llama 3.1 405B on Google Cloud Vertex AI

Serverless Inference with Hugging Face and NVIDIA NIMs

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

Google Cloud TPUs made available to Hugging Face users

Welcome Gemma 2 - Google's new open LLM

Introducing the Hugging Face Embedding Container for Amazon SageMaker

Deploy models on AWS Inferentia2 from Hugging Face

From cloud to developers: Hugging Face and Microsoft Deepen Collaboration

Build AI on premise with Dell Enterprise Hub

Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints

Welcome Llama 3 - Meta's new open LLM

Making thousands of open LLMs bloom in the Vertex AI Model Garden

CodeGemma - an official Google release for code LLMs

Bringing serverless GPU inference to Hugging Face users

Easily Train Models with H100 GPUs on NVIDIA DGX Cloud

Welcome Gemma - Google's new open LLM

From OpenAI to Open LLMs with Messages API

Hugging Face Text Generation Inference available for AWS Inferentia2

Hugging Face and Google partner for open AI collaboration

Mixture of Experts Explained

Welcome Mixtral - a SOTA Mixture of Experts on Hugging Face

Deploy Embedding Models with Hugging Face Inference Endpoints

Llama 2 on Amazon SageMaker a Benchmark

Fine-tuning Llama 2 70B using PyTorch FSDP

Spread Your Wings: Falcon 180B is here

Code Llama: Llama 2 learns to code

Introducing SafeCoder

Hugging Face Platform on the AWS Marketplace: Pay with your AWS Account

Llama 2 is here - get it on Hugging Face

Deploy LLMs with Hugging Face Inference Endpoints

The Falcon has landed in the Hugging Face ecosystem

Introducing the Hugging Face LLM Inference Container for Amazon SageMaker

Hugging Face Collaborates with Microsoft to Launch Hugging Face Model Catalog on Azure

Creating a Coding Assistant with StarCoder

Accelerating Hugging Face Transformers with AWS Inferentia2

Hugging Face and AWS partner to make AI more accessible

Pre-Train BERT with Hugging Face Transformers and Habana Gaudi

Convert Transformers to ONNX with Hugging Face Optimum

Accelerated Inference with Optimum and Transformers Pipelines

Accelerate BERT inference with Hugging Face Transformers and AWS inferentia

Case Study: Millisecond Latency using Hugging Face Infinity and modern CPUs

Deploy GPT-J 6B for inference using Hugging Face Transformers and Amazon SageMaker

Few-shot learning in practice: GPT-NEO and the 🤗 Accelerated Inference API

Distributed Training: Train BART/T5 for Summarization using 🤗 Transformers and Amazon SageMaker

Organizations

philschmid's activity

Upload hugs-marketplace-listing-inf2.png

Upload hugs-banner.png

Add Digital Ocean resources (low quality)

Update HUGS images for AWS Marketplace

Please provide chosen & rejected inputs in the messages format

Update README.md

Upload `google-cloud/thumbnail.png`

Fix library for correct "use" and "deploy" options

Reflection not using correct system prompt

Update src/lib/data.ts

Update src/lib/data.ts

Create Google-Cloud-Containers/_versions.yml

Add `thumnail.png` for `Google-Cloud-Containers`

New model provider: Novita AI

1-click to GCP Vertex AI Endpoint fails

Update src/lib/data.ts

Librarian Bot: Add language metadata for dataset

Small difference in the IBM WatsonX price compared to Excel