view post Post 195 Reply The most followed orgs on Hugging Face 🤗Some organizations that deserve more 🧡- https://huggingface.co/mistralai- https://huggingface.co/Qwen- https://huggingface.co/deepseek-ai
view post Post 573 Reply You can clean and format datasets entirely in the browser with a few lines of SQL. In this post, I replicate the process @mlabonne used to clean the new microsoft/orca-agentinstruct-1M-v1 dataset. The cleaning process consists of:- Joining the separate splits together / add split column- Converting string messages into list of structs- Removing empty system promptshttps://huggingface.co/blog/cfahlgren1/the-beginners-guide-to-cleaning-a-datasetHere's his new cleaned dataset: mlabonne/orca-agentinstruct-1M-v1-cleaned
MLC WebLLM Running 25 ⚡ Phi-3.5-Mini WebLLM Running 16 ⚡ Qwen-2.5 WebLLM Running 122 🏎️ WebLLM Playground Running 134 🐍 Qwen 2.5 Code Interpreter
NaturalFunctions LLMs fine tuned for function calling 🤖 cfahlgren1/natural-functions Text Generation • Updated Jan 26 • 34 • 46 cfahlgren1/natural-functions-GGUF Updated Jan 29 • 91 • 17