Matt Valoatto PRO

mvaloatto

AI & ML interests

Image classification, image feature extraction, text classification, design, art, tech, science. πŸ€— since 2016.

Recent Activity

Organizations

AI FILMS's profile picture lora concepts library's profile picture Stable Diffusion Dreambooth Concepts Library's profile picture huggingPartyParis's profile picture Spaces Playground's profile picture Social Post Explorers's profile picture Top Contributors: Space Likes's profile picture Top Contributors: Dataset Downloads's profile picture Top Contributors: Model Downloads's profile picture Top Contributors: Profile Followers's profile picture Tamis AI's profile picture Hugging Face Discord Community's profile picture

mvaloatto's activity

New activity in mvaloatto/TCTF 8 months ago
reacted to victor's post with πŸ”₯ 8 months ago
view post
Post
4285
The hype is real: a mysterious gpt2-chatbot model has appeared on the LLM Arena Leaderboard πŸ‘€.
It seems to be at least on par with the top performing models (closed and open).

To try it out: https://chat.lmsys.org/ -> then click on the Direct Chat tab and select gpt2-chatbot.

Take your bet, what do you think it is?
Β·
liked a Space 8 months ago
reacted to clem's post with πŸ€— 9 months ago
view post
Post
2535
Introducing gretelai/synthetic_text_to_sql by https://huggingface.co/gretelai

It stands as the largest and most diverse synthetic Text-to-SQL dataset available to-date.

The dataset includes:

- 105,851 records partitioned into 100,000 train and 5,851 test records
~23M total tokens, including ~12M SQL tokens
- Coverage across 100 distinct domains/verticals
- Comprehensive array of SQL tasks: data definition, retrieval, manipulation, analytics & reporting
- Wide range of SQL complexity levels, including subqueries, single joins, multiple joins, aggregations, window functions, set operations
- Database context, including table and view create statements
- Natural language explanations of what the SQL query is doing
- Contextual tags to optimize model training

Blogpost: https://gretel.ai/blog/synthetic-text-to-sql-dataset
Dataset: gretelai/synthetic_text_to_sql
  • 1 reply
Β·