Daniel Vila

dvilasuero

AI & ML interests

RLHF, RLAIF, DPO, data, data, data

Recent Activity

Articles

Organizations

Hugging Face's profile picture Cohere For AI's profile picture SomosNLP's profile picture Libre Euro Lingua-Alliance's profile picture Hugging Face H4's profile picture Hugging Face OSS Metrics's profile picture Argilla's profile picture Blog-explorers's profile picture Hugging Face TB Research's profile picture h4-argilla-collab's profile picture ZeroGPU Explorers's profile picture mLLM multilingual's profile picture DIBT Spanish's profile picture Data is Better Together - Russian Language Team's profile picture Argilla Explorers's profile picture Open Arabic LLM Leaderboard's profile picture distilabel-internal-testing's profile picture ORPO Explorers's profile picture Data Is Better Together's profile picture Social Post Explorers's profile picture HuggingFaceFW-Dev's profile picture LLHF's profile picture UCSF-JHU Opioid Industry Documents Archive's profile picture SLLHF's profile picture Hugging Quants's profile picture argilla-internal-testing's profile picture Argilla Warehouse's profile picture rg-preview's profile picture Dataset Tools's profile picture open/ acc's profile picture Data Is Better Together Contributor's profile picture

dvilasuero's activity

upvoted an article 2 days ago
view article
Article

Let’s make a generation of amazing image generation models

By burtenshaw β€’
β€’ 30
Reacted to elliesleightholm's post with πŸ”₯πŸ€— 6 days ago
upvoted an article 6 days ago
Reacted to andito's post with ❀️ 8 days ago
view post
Post
1077
Hugging face presents FineVideo πŸŽ₯! Unlocking the next generation of Video understanding πŸš€

🀯3400 hours of annotated Creative Common videos with rich character descriptions, scene splits, mood, and content descriptions per scene as well as QA pairs.
πŸ”₯
@mfarre processed over 2M videos of Youtube-CC to make this incredibly powerful selection.

Very psyched to fine-tune idefics on this dataset. ⚑️
Explore the videos: HuggingFaceFV/FineVideo-Explorer
Reacted to singhsidhukuldeep's post with πŸ‘ 8 days ago
view post
Post
1242
Sorry judge, my lawyer hallucinated? πŸ˜‚ If you get an AI lawyer, you would want it to be hallucination-free!

New @Stanford -@Yale research reveals surprising findings about leading AI legal research tools. Here's what you need to know:

>> Key Findings
The study tested LexisNexis (Lexis+ AI), Thomson Reuters (Westlaw AI & Ask Practical Law AI), and GPT-4, finding hallucination rates between 17-33% despite claims of being "hallucination-free".

>> Technical Deep Dive
The research evaluated these tools using Retrieval-Augmented Generation (RAG) architecture, which operates in two crucial steps:

1. Retrieval System:
- Uses neural text embeddings to capture semantic meaning
- Employs both lexical and semantic search mechanisms
- Implements document filtering and extraction
- Retrieves relevant legal documents from vast databases

2. Generation Pipeline:
- Processes retrieved documents alongside original queries
- Synthesizes information from multiple legal sources
- Generates responses based on retrieved context
- Includes citation verification mechanisms

>> Performance Breakdown:
- Lexis+ AI: 65% accuracy rate
- Westlaw AI: 42% accuracy rate
- Ask Practical Law AI: Over 60% incomplete answers

>> Why This Matters
This research exposes critical vulnerabilities in AI legal tools that lawyers increasingly rely on. It's essential for legal professionals to understand these limitations when incorporating AI into their practice.