Tom Aarsen

tomaarsen

AI & ML interests

NLP: text embeddings, named entity recognition, few-shot text classification

Articles

Organizations

tomaarsen's activity

upvoted an article 2 days ago
view article
Article

BM25 for Python: Achieving high performance while simplifying dependencies with *BM25S*⚡

By xhluca
21
upvoted 2 articles 5 days ago
view article
Article

Open-source embeddings and LLMs outperform Gemini and OpenAI for Web Navigation while being faster and cheaper

By dhuynh95
4
view article
Article

Training and Finetuning Embedding Models with Sentence Transformers v3

103
upvoted an article 19 days ago
view article
Article

Introducing the Hugging Face Embedding Container for Amazon SageMaker

9
upvoted an article 21 days ago
view article
Article

Introducing NPC-Playground, a 3D playground to interact with LLM-powered NPCs

12
upvoted an article 27 days ago
view article
Article

How to Fine-Tune Custom Embedding Models Using AutoTrain

By abhishek
10
upvoted an article 28 days ago
view article
Article

Benchmarking Text Generation Inference

24
upvoted an article about 1 month ago
upvoted 2 articles about 1 month ago
view article
Article

Build AI on premise with Dell Enterprise Hub

13
upvoted 2 articles about 1 month ago
view article
Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

152
upvoted an article about 1 month ago
view article
Article

Hugging Face x LangChain : A new partner package in LangChain

82
upvoted 4 articles about 2 months ago
view article
Article

Train Custom Models on Hugging Face Spaces with AutoTrain SpaceRunner

By abhishek
7
view article
Article

⚗️ 🧑🏼‍🌾 Let's grow some Domain Specific Datasets together

27
view article
Article

Synthetic data: save money, time and carbon with open source

32
upvoted an article about 2 months ago
view article
Article

Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent

75
upvoted an article about 2 months ago
view article
Article

StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation

70
upvoted an article 2 months ago
view article
Article

🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets

63
upvoted an article 2 months ago
view article
Article

Welcome Llama 3 - Meta's new open LLM

253
upvoted 2 articles 2 months ago
view article
Article

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

43
upvoted an article 3 months ago
view article
Article

Hugging Face partners with Wiz Research to Improve AI Security

11