Optimum Internal Testing

https://github.com/huggingface/optimum

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

dacorvo updated a model 1 day ago

optimum-internal-testing/neuron-tgi-testing-gpt2-5f6d7f29da

dacorvo published a model 1 day ago

optimum-internal-testing/neuron-tgi-testing-gpt2-5f6d7f29da

optimum-internal-testing-user updated a model 1 day ago

optimum-internal-testing/tiny_random_bert_neuronx

View all activity

optimum-internal-testing's activity

dacorvo

updated a model 1 day ago

optimum-internal-testing/neuron-tgi-testing-gpt2-5f6d7f29da

Updated 1 day ago • 64

dacorvo

published a model 1 day ago

optimum-internal-testing/neuron-tgi-testing-gpt2-5f6d7f29da

Updated 1 day ago • 64

optimum-internal-testing-user

updated a model 1 day ago

optimum-internal-testing/tiny_random_bert_neuronx

Feature Extraction • Updated 1 day ago • 1.99k

dacorvo

updated a model 1 day ago

optimum-internal-testing/neuron-tgi-testing-granite-5f6d7f29da

Updated 1 day ago • 64

dacorvo

published a model 1 day ago

optimum-internal-testing/neuron-tgi-testing-granite-5f6d7f29da

Updated 1 day ago • 64

dacorvo

updated a model 1 day ago

optimum-internal-testing/neuron-tgi-testing-qwen2-5f6d7f29da

Updated 1 day ago • 64

dacorvo

published a model 1 day ago

optimum-internal-testing/neuron-tgi-testing-qwen2-5f6d7f29da

Updated 1 day ago • 64

dacorvo

updated a model 1 day ago

optimum-internal-testing/neuron-tgi-testing-mistral-5f6d7f29da

Updated 1 day ago • 64

dacorvo

published a model 1 day ago

optimum-internal-testing/neuron-tgi-testing-mistral-5f6d7f29da

Updated 1 day ago • 64

dacorvo

updated a model 1 day ago

optimum-internal-testing/neuron-tgi-testing-llama-5f6d7f29da

Updated 1 day ago • 64

dacorvo

published a model 1 day ago

optimum-internal-testing/neuron-tgi-testing-llama-5f6d7f29da

Updated 1 day ago • 64

dacorvo

updated a model 1 day ago

optimum-internal-testing/neuron-tgi-testing-granite-sha-77df188-neuron

Updated 1 day ago • 16

dacorvo

published a model 1 day ago

optimum-internal-testing/neuron-tgi-testing-granite-sha-77df188-neuron

Updated 1 day ago • 16

dacorvo

updated a model 1 day ago

optimum-internal-testing/neuron-tgi-testing-qwen2-sha-77df188-neuron

Updated 1 day ago • 16

dacorvo

published a model 1 day ago

optimum-internal-testing/neuron-tgi-testing-qwen2-sha-77df188-neuron

Updated 1 day ago • 16

dacorvo

updated a model 1 day ago

optimum-internal-testing/neuron-tgi-testing-mistral-sha-77df188-neuron

Updated 1 day ago • 16

dacorvo

published a model 1 day ago

optimum-internal-testing/neuron-tgi-testing-mistral-sha-77df188-neuron

Updated 1 day ago • 16

sayakpaul

posted an update 5 days ago

Post

2749

Inference-time scaling meets Flux.1-Dev (and others) 🔥

Presenting a simple re-implementation of "Inference-time scaling diffusion models beyond denoising steps" by Ma et al.

I did the simplest random search strategy, but results can potentially be improved with better-guided search methods.

Supports Gemini 2 Flash & Qwen2.5 as verifiers for "LLMGrading" 🤗

The steps are simple:

For each round:

1> Starting by sampling 2 starting noises with different seeds.
2> Score the generations w.r.t a metric.
3> Obtain the best generation from the current round.

If you have more compute budget, go to the next search round. Scale the noise pool (2 ** search_round) and repeat 1 - 3.

This constitutes the random search method as done in the paper by Google DeepMind.

Code, more results, and a bunch of other stuff are in the repository. Check it out here: https://github.com/sayakpaul/tt-scale-flux/ 🤗

regisss

posted an update 8 days ago

Post

1600

Nice paper comparing the fp8 inference efficiency of Nvidia H100 and Intel Gaudi2: An Investigation of FP8 Across Accelerators for LLM Inference (2502.01070)

The conclusion is interesting: "Our findings highlight that the Gaudi 2, by leveraging FP8, achieves higher throughput-to-power efficiency during LLM inference"

One aspect of AI hardware accelerators that is often overlooked is how they consume less energy than GPUs. It's nice to see researchers starting carrying out experiments to measure this!

Gaudi3 results soon...

sayakpaul

posted an update 24 days ago

Post

1963

We have been cooking a couple of fine-tuning runs on CogVideoX with finetrainers, smol datasets, and LoRA to generate cool video effects like crushing, dissolving, etc.

We are also releasing a LoRA extraction utility from a fully fine-tuned checkpoint. I know that kind of stuff has existed since eternity, but the quality on video models was nothing short of spectacular. Below are some links:

* Models and datasets: https://huggingface.co/finetrainers
* finetrainers: https://github.com/a-r-r-o-w/finetrainers
* LoRA extraction: https://github.com/huggingface/diffusers/blob/main/scripts/extract_lora_from_model.py

1 reply

AI & ML interests

Recent Activity

Team members 11

optimum-internal-testing's activity