David Quispe PRO

daqc

AI & ML interests

Education

Recent Activity

liked a dataset 1 day ago

Llamacha/monolingual-quechua-iic

liked a dataset 1 day ago

pollitoconpapass/cuzco-quechua-translation-spanish

liked a dataset 1 day ago

Zeal-Nir/quechua-audioset-4GB

View all activity

Organizations

daqc's activity

liked 7 datasets 1 day ago

upvoted a paper 1 day ago

Smaller Language Models Are Better Instruction Evolvers

Paper • 2412.11231 • Published 19 days ago • 26

liked a dataset 1 day ago

HuggingFaceFW/fineweb-2

Viewer • Updated 26 days ago • 13.8B • 105k • 379

liked a model 1 day ago

yulan-team/YuLan-Mini

Text Generation • Updated about 6 hours ago • 566 • 26

upvoted a collection 1 day ago

HuatuoGPT-o1

Collection

4 items • Updated 4 days ago • 6

reacted to prithivMLmods's post with 🤗 1 day ago

Post

2721

Triangulum Catalogued 🔥💫

🎯Triangulum is a collection of pretrained and instruction-tuned generative models, designed for multilingual applications. These models are trained using synthetic datasets based on long chains of thought, enabling them to perform complex reasoning tasks effectively.

+ Triangulum-10B : prithivMLmods/Triangulum-10B
+ Quants : prithivMLmods/Triangulum-10B-GGUF

+ Triangulum-5B : prithivMLmods/Triangulum-5B
+ Quants : prithivMLmods/Triangulum-5B-GGUF

+ Triangulum-1B : prithivMLmods/Triangulum-1B
+ Quants : prithivMLmods/Triangulum-1B-GGUF

1 reply

reacted to roseking's post with 🚀 1 day ago

Post

2353

🤗 Hugging Face Download Tool

The Hugging Face Download Tool is a sophisticated graphical user interface application designed to simplify the process of downloading resources from Hugging Face repositories. This tool addresses common challenges in model and file downloads through its intelligent features and user-friendly interface.

✨ Key Features
- 🖥️ Intuitive graphical interface for easy operation
- 🔄 Advanced retry mechanism with smart error handling
- ⏸️ Resume capability for interrupted downloads
- 📊 Real-time download status monitoring
- 🔐 Secure access to private repositories via token authentication

🛠️ Technical Highlights
The tool implements several advanced features to ensure reliable downloads:
- 📦 Chunk-based downloading with 1MB segments
- ⚡ Adaptive retry intervals (5-300 seconds) based on error types
- 🔌 Connection pooling for optimized performance
- 🛡️ Built-in rate limiting protection
- 🔑 Secure token handling for private repository access

This tool is ideal for researchers, developers, and AI practitioners who regularly work with Hugging Face resources and need a reliable, user-friendly download solution. 💻 It supports all major operating systems and requires minimal setup, making it accessible to users of all technical levels. 🚀

GitHub：https://github.com/2404589803/hf_downloader

2 replies

updated a model 3 days ago

daqc/SmolLM2-FT-ORPO-Medicina-es

Text Generation • Updated 3 days ago • 7

liked 2 Spaces 3 days ago

Runtime error

💬

Discussion Forum

Running

335

🧬

Synthetic Data Generator

Build datasets using natural language

updated 2 datasets 4 days ago

daqc/medicina-qa-binarized-dpo-orpo-es

Viewer • Updated 4 days ago • 10k • 2

daqc/medicina-qa-dpo-orpo-format-es

Viewer • Updated 4 days ago • 10k • 2

updated a model 4 days ago

daqc/SmolLM2-FT-DPO-Medicina_es

Text Generation • Updated 4 days ago • 8

reacted to lewtun's post with 👀 4 days ago

Post

6624

We outperform Llama 70B with Llama 3B on hard math by scaling test-time compute 🔥

How? By combining step-wise reward models with tree search algorithms :)

We show that smol models can match or exceed the performance of their much larger siblings when given enough "time to think"

We're open sourcing the full recipe and sharing a detailed blog post.

In our blog post we cover:

📈 Compute-optimal scaling: How we implemented DeepMind's recipe to boost the mathematical capabilities of open models at test-time.

🎄 Diverse Verifier Tree Search (DVTS): An unpublished extension we developed to the verifier-guided tree search technique. This simple yet effective method improves diversity and delivers better performance, particularly at large test-time compute budgets.

🧭 Search and Learn: A lightweight toolkit for implementing search strategies with LLMs and built for speed with vLLM

Here's the links:

- Blog post: HuggingFaceH4/blogpost-scaling-test-time-compute

- Code: https://github.com/huggingface/search-and-learn

Enjoy!

2 replies