Luke Neumann's picture
2

Luke Neumann PRO

LukeNeumann

AI & ML interests

Fine tuning training data

Recent Activity

Organizations

Overlai.ai's profile picture

LukeNeumann's activity

posted an update 26 days ago
view post
Post
1031
I had a question about Trending datasets. Our initial dataset "Oregon Coast in 4K" was trending at #3 for video at about 700 downloads.

Over the past two days our downloads have spiked, now up to over 2,000, but the dataset has dropped down to the 3rd or 4th page of Trending.

What metrics are used to determine dataset Trending position?
  • 1 reply
ยท
posted an update about 1 month ago
view post
Post
558
Something I've been thinking about regarding fine-tuning video models.

How well are these models trained on the fundamentals of image capture? Take shutter speed/angle, for instance.

The difference in a video with a 1/2000 speed shutter and a 1/60 speed shutter is drastic.

What about color science? HDR vs. SDR, Anamorphic vs. spherical, sensor size, aspect ratio.

Is it worth doing an open-source dataset "film school series"?

It may seem too granular but I wonder if this level of granularity makes for better downstream results.
  • 1 reply
ยท
reacted to thomwolf's post with ๐Ÿคฏ about 1 month ago
replied to their post about 1 month ago
view reply

It would be helpful to know what kind of dataset to build! We have a lot of content.

replied to their post about 1 month ago
view reply

Do you think you would train with the full 8K?? What subject matter do you think would be best for the initial set?

posted an update about 1 month ago
view post
Post
1218
Nine years ago, I uploaded the first 8K resolution video to YouTube and I've been stockpiling 8K footage ever since: https://www.youtube.com/watch?v=sLprVF6d7Ug&t

Should @Overlaiapp release the first open-source 8K video dataset?

Could anyone even fine tune a model with this?๐Ÿ˜…
ยท
posted an update about 1 month ago
view post
Post
1858
Hello Hugging Face community!

I wanted to introduce myself and my company @Overlaiapp . We are a collective of filmmakers, photographers, and AI engineers working on high resolution (8K+) training data.

We plan to share a lot of our datasets with the community and are kicking things off with two curated datasets:

- Overlaiai/OregonCoastin4K

- Overlaiai/SubArcticPolarBear


Overlai.ai Dataset Features

๐ŸŽฅ Oversampled: Every clip is captured in stunning 8K resolution, delivering rich detail ideal for fine tuning scenic landscapes and ocean dynamics.

๐Ÿ“ธ Variance: Includes close-up details, slow-motion footage of crashing waves, sweeping landscapes, and wildlife shots.

๐Ÿ“‹ Detailed Metadata: Every clip is paired with structured metadata, including creative descriptions, precise camera movements, lens information, field of view calculations, and shot settings, ensuring AI models can fully understand and replicate real-world cinematography with accuracy.

โš™๏ธ Consistency: Re-thinking training data at the point of capture by "overshooting" a subject, enabling models to learn more nuanced relationships and views across scenes.

๐ŸŒ… Light: Shot during early morning and sunset light for optimal color contrast and dynamic range, maximizing visual quality for color and lighting-sensitive tasks.

๐Ÿ” Curation: Curated specifically for machine learning, providing clean, high-quality data for next generation model training.