baconnier loic's picture

baconnier loic PRO

baconnier

·

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

nomic-ai/modernbert-embed-base

liked a model 4 days ago

FunAudioLLM/CosyVoice2-0.5B

liked a Space 4 days ago

FunAudioLLM/CosyVoice2-0.5B

View all activity

Organizations

baconnier's activity

upvoted an article 26 days ago

Article

EuroLLM-9B

By

•

about 1 month ago

• 105

upvoted an article about 2 months ago

Article

SauerkrautLM's Multi-Phase Spectrum Training: A Technical Deep Dive

By

•

Nov 9, 2024

• 9

upvoted 2 articles 2 months ago

Article

How to build a custom text classifier without days of human labeling

By

•

Oct 17, 2024

• 55

Article

Decoding Strategies in Large Language Models

By

•

Oct 29, 2024

• 38

upvoted an article 3 months ago

Article

Document Similarity Search with ColPali

By

•

Sep 21, 2024

• 48

upvoted an article 4 months ago

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

By

•

Jul 29, 2024

• 260

upvoted a paper 4 months ago

Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment

Paper • 2408.06266 • Published Aug 12, 2024 • 9

upvoted an article 4 months ago

Article

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

Mar 22, 2024

• 69

upvoted 2 articles 5 months ago

Article

Uncensor any LLM with abliteration

By

•

Jun 13, 2024

• 386

Article

Training and Finetuning Embedding Models with Sentence Transformers v3

May 28, 2024

• 167

upvoted an article 6 months ago

Article

BM25 for Python: Achieving high performance while simplifying dependencies with BM25S⚡

By

•

Jul 9, 2024

• 41

upvoted a paper 6 months ago

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Paper • 2406.14491 • Published Jun 20, 2024 • 86

upvoted 2 articles 7 months ago

Article

Introducing the Ultimate SEC LLM: Revolutionizing Financial Insights with Llama-3-70B

By

•

Jun 19, 2024

• 7

Article

Building a Vision Mixture-of-Expert Model from several fine-tuned Phi-3-Vision Models

By

•

Jun 12, 2024

• 6

upvoted a paper 8 months ago

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15, 2024 • 87

upvoted a collection 9 months ago

Unmixtraled experts

This collections contains all 8 experts of Mixtral 8x22B converted to single dense 22B models. The models are intended as basis for merges or finetune • 9 items • Updated Apr 11, 2024 • 1

upvoted 2 collections 10 months ago

💥 Laser vs DoRA vs Daser vs LoRA

Comparison of different PEFT techniques of NeuralMonarch. • 4 items • Updated Mar 22, 2024 • 6

Model Merging

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 218

upvoted a paper 11 months ago

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 73

upvoted a collection 11 months ago

🐶 Beagle

Merges done using mergekit and LazyMergekit: https://colab.research.google.com/drive/1obulZ1ROXHjYLn6PPZJwRR6GzgQogxxb#scrollTo=d5mYzDo1q96y • 8 items • Updated Aug 16, 2024 • 6