Tom Hunn

thunnai

thunn

AI & ML interests

I've been working in generative AI since it was still called "Deep Learning." Most of my experience is in the audio space, but I've also worked on projects involving llms. I've had hands-on experience across the full AI product lifecycle, including: - AI Research & Model Training - MLOps & Productionizing Models - Product Strategy — exploring what’s possible with emerging tech

Recent Activity

Reacted to MoritzLaurer's post with 🔥 3 days ago

I've been building a small library for working with prompt templates on the HF hub: `pip install prompt-templates`. Motivation: The community currently shares prompt templates in a wide variety of formats: in datasets, in model cards, as strings in .py files, as .txt/.yaml/.json/.jinja2 files etc. This makes sharing and working with prompt templates unnecessarily complicated. Prompt templates are currently the main hyperparameter that people tune when building complex LLM systems or agents. If we don't have a common standard for sharing them, we cannot systematically test and improve our systems. After comparing different community approaches, I think that working with modular .yaml or .json files is the best approach. The `prompt-templates` library : - proposes a standard for sharing prompts (entirely locally or on the HF hub) - provides some utilities that are interoperable with the broader ecosystem Try it: ```py # !pip install prompt-templates from prompt_templates import PromptTemplateLoader prompt_template = PromptTemplateLoader.from_hub(repo_id="MoritzLaurer/closed_system_prompts", filename="claude-3-5-artifacts-leak-210624.yaml") ``` The library is in early stages, feedback is welcome! More details in the docs: https://github.com/MoritzLaurer/prompt_templates/

replied to freddyaboulton's post 4 days ago

Just created a cookbook of real time audio/video spaces created using Gradio and WebRTC ⚡️ Use this and the [docs](https://freddyaboulton.github.io/gradio-webrtc/) to get started building the next gen of AI apps! https://huggingface.co/collections/freddyaboulton/gradio-webrtc-cookbook-6758ba7745aeca7b1be7de0f

Reacted to freddyaboulton's post with 🚀 4 days ago

View all activity

Organizations

None yet

thunnai's activity

reacted to MoritzLaurer's post with 🔥 3 days ago

Post

1091

I've been building a small library for working with prompt templates on the HF hub: pip install prompt-templates. Motivation:

The community currently shares prompt templates in a wide variety of formats: in datasets, in model cards, as strings in .py files, as .txt/.yaml/.json/.jinja2 files etc. This makes sharing and working with prompt templates unnecessarily complicated.

Prompt templates are currently the main hyperparameter that people tune when building complex LLM systems or agents. If we don't have a common standard for sharing them, we cannot systematically test and improve our systems. After comparing different community approaches, I think that working with modular .yaml or .json files is the best approach.

The prompt-templates library :
- proposes a standard for sharing prompts (entirely locally or on the HF hub)
- provides some utilities that are interoperable with the broader ecosystem

Try it:

# !pip install prompt-templates
from prompt_templates import PromptTemplateLoader 
prompt_template = PromptTemplateLoader.from_hub(repo_id="MoritzLaurer/closed_system_prompts", filename="claude-3-5-artifacts-leak-210624.yaml")

The library is in early stages, feedback is welcome!

More details in the docs: https://github.com/MoritzLaurer/prompt_templates/

replied to freddyaboulton's post 4 days ago

Awesome work on this! Another big win for rapid prototyping with Gradio!

reacted to freddyaboulton's post with 🚀🔥 4 days ago

Post

1005

Just created a cookbook of real time audio/video spaces created using Gradio and WebRTC ⚡️

Use this and the [docs](https://freddyaboulton.github.io/gradio-webrtc/) to get started building the next gen of AI apps!

freddyaboulton/gradio-webrtc-cookbook-6758ba7745aeca7b1be7de0f

2 replies

liked a Space 6 days ago

Running on Zero

🏵️

StableDelight

liked a model 6 days ago

Datou1111/shou_xin

Text-to-Image • Updated 7 days ago • 15.3k • • 436

liked a model 10 days ago

facebook/mbart-large-50-many-to-many-mmt

Translation • Updated Sep 28, 2023 • 241k • 310

liked a Space 10 days ago

Running on Zero

🎵

MelodyFlow

liked a model 12 days ago

oxyapi/oxy-1-small

Text Generation • Updated 12 days ago • 1.12k • 70

liked 2 models 14 days ago

rain1011/pyramid-flow-miniflux

Text-to-Video • Updated Nov 13 • 148

OuteAI/OuteTTS-0.2-500M

Text-to-Speech • Updated 13 days ago • 18k • 268

liked a model 18 days ago

InstantX/FLUX.1-dev-IP-Adapter

Text-to-Image • Updated 23 days ago • 8.89k • 210

liked a model 20 days ago

google/byt5-large

Text2Text Generation • Updated Jan 24, 2023 • 128k • 12

liked a dataset 21 days ago

HuggingFaceTB/smoltalk

Viewer • Updated 20 days ago • 2.2M • 9.87k • 248

reacted to TuringsSolutions's post with 👀 23 days ago

Post

849

I created something called 'Hyperbolic Embeddings'. I literally just embed the tokens into Hyperbolic Space instead of Euclidean space. At first, this did not get me the gains I was expecting. I was a sad panda. Then I thought about it, a Hyperbolic Embedding needs a Hyperbolic Optimizer. So, instead of Adam, I used Riemannian Adam (RAdam). "Ladies and Gentlemen, We Got 'Em!"

27 replies

liked a model 23 days ago

ali-vilab/In-Context-LoRA

Text-to-Image • Updated 29 days ago • 171k • • 481

reacted to cdminix's post with 🔥 25 days ago

Post

982

As part of some ongoing work, I'm releasing the currently biggest collection of docker containers for state-of-the-art voice cloning TTS systems.
https://github.com/ttsds/datasets

Alongside there is also a nice overview of all systems (see below)

liked 3 datasets 27 days ago