moock (Max)

upvoted an article 19 days ago

Article

Use Models from the Hugging Face Hub in LM Studio

By

•

24 days ago

• 127

liked a model 2 months ago

ostris/OpenFLUX.1

Text-to-Image • Updated Oct 3 • 10.2k • 589

liked 2 Spaces 3 months ago

Running

247

📊

PuLID-FLUX

liked a Space 5 months ago

Running on Zero

3.83k

🏎️💨

Llama 3.1

Collection

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated 16 days ago • 636

reacted to clem's post with 🚀 6 months ago

Post

5781

5,000 new repos (models, datasets, spaces) are created EVERY DAY on HF now. The community is amazing!

liked a Space 6 months ago

Running on Zero

1.55k

🎨

Stable Diffusion 3 Medium

replied to lunarflu's post 7 months ago

It would be fun to have a prediction of my future daily activities 🪄

reacted to lunarflu's post with 🔥 7 months ago

Post

1933

cooking up something....anyone interested in a daily activity tracker for HF?

12 replies

·

reacted to singhsidhukuldeep's post with 👍 7 months ago

Post

2085

🎭 You picked an LLM for your work but then you find out it hallucinates! 🤖

🤔 Your first thought might be to fine-tune it on more training data.... but should you? 🛠️

📜 This is what @Google is exploring in the paper "Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations?" 🕵️‍♂️

📘 When LLMs undergo supervised fine-tuning with new factual knowledge not present in their initial training data, there is a risk they might "hallucinate" or produce factually incorrect information. 🚨

🔍 The paper investigates how fine-tuning LLMs with new facts influences their ability to leverage pre-existing knowledge and the extent to which they generate errors. 📊

⚙️Technical Setup:

🔧 Approach: They introduce a system named SliCK (this stands for Sampling-based Categorization of Knowledge, don't even bother understanding how) to categorize knowledge into four levels (HighlyKnown, MaybeKnown, WeaklyKnown, and Unknown) based on how well the model's generated responses agree with known facts. 🗂️

📝 Experimental Setup: The study uses a controlled setup focusing on closed-book QA, adjusting the proportion of fine-tuning examples that introduce new facts versus those that do not. 🧪

👉 Here is the gist of the findings:

🚸 LLMs struggle to integrate new factual knowledge during fine-tuning, and such examples are learned slower than those consistent with the model's pre-existing knowledge. 🐢

📈 As LLMs learn from examples containing new knowledge, their propensity to hallucinate increases. 👻

⏱️ Early stopping during training can mitigate the risks of hallucinations by minimizing exposure to unlearned new facts. 🛑

🧠 Training LLMs mostly with known examples leads to better utilization of pre-existing knowledge, whereas examples introducing new knowledge increase the risk of generating incorrect information. 🏗️

📄 Paper: Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations? (2405.05904) 📚