55 2 13

nyuuzyou PRO

nyuuzyou

https://ducks.party/donate

AI & ML interests

None yet

Recent Activity

posted an update about 1 hour ago

🎨 Artfol Dataset - https://huggingface.co/datasets/nyuuzyou/artfol A collection of 1,892,816 artwork posts featuring: - High-quality art pieces with various styles and techniques - Complete metadata including artist IDs, titles, and moderation flags - Content from Artfol social media platform The dataset contains: - Public domain artwork posts - Artist attribution and identifiers - Direct image URLs and web page links - Content safety flags (NSFW, gore) - Post titles and descriptions All content is available under CC0 license, allowing unrestricted use including commercial applications.

updated a collection about 3 hours ago

Anime & Art

updated a dataset about 3 hours ago

nyuuzyou/artfol

View all activity

Organizations

nyuuzyou's activity

posted an update about 1 hour ago

Post

🎨 Artfol Dataset - nyuuzyou/artfol

A collection of 1,892,816 artwork posts featuring:
- High-quality art pieces with various styles and techniques
- Complete metadata including artist IDs, titles, and moderation flags
- Content from Artfol social media platform

The dataset contains:
- Public domain artwork posts
- Artist attribution and identifiers
- Direct image URLs and web page links
- Content safety flags (NSFW, gore)
- Post titles and descriptions

All content is available under CC0 license, allowing unrestricted use including commercial applications.

posted an update 8 days ago

Post

1465

🗂️ I don't think the collections feature of Hugging Face is widely used, even though it's an excellent way to organize and discover interesting resources. To do my bit to change that, I've created two carefully curated collections that combine both my original work and other valuable datasets:

Educational Datasets
- Mostly English-Russian, but other languages are also included
- Extended by my new Begemot.ai dataset (2.7M+ Russian education records) nyuuzyou/begemot

Link: nyuuzyou/educational-datasets-677c268978ac1cec96cc3605

Anime & Art

- Extensive art-focused collection, including my new datasets:
- Buzzly.art (2K artworks) nyuuzyou/buzzlyart
- Paintberri (60K+ pieces) nyuuzyou/paintberri
- Itaku.ee (924K+ items) nyuuzyou/itaku
- Extended with other amazing datasets from the community

Link: nyuuzyou/anime-and-art-677ae996682a389fccd892c3

Collections should become a more common feature - hopefully this will encourage others to create and share their own curated collections. By organizing related datasets into these themed collections, I hope to make it easier for researchers and developers to discover and use these valuable resources.

1 reply

reacted to nroggendorff's post with ➕ 10 days ago

Post

1692

Why do we only get to post once every 24 hours? I've been waiting *so long*. Anyway, now that the wait is finally over, I have some very important information to share.

1 reply

posted an update 19 days ago

Post

581

🎮 ALLSTAR.GG Dataset - nyuuzyou/allstar

A collection of 47,896 gaming clips featuring:
- High-quality gameplay captures with various clip lengths and resolutions
- Complete metadata including user IDs, clip titles, and game parameters
- Content captured from Counter-Strike 2 competitive matches
- Full game statistics and technical parameters

posted an update 20 days ago

Post

2214

🎨 KLING AI Dataset - nyuuzyou/klingai

A collection of 12,782 AI-generated media items featuring:
- High-quality image and video generations at various resolutions
- Complete metadata including user IDs, prompts, and generation parameters
- Content generated using text-to-image, text-to-video, and image-to-video modalities
- Full generation settings and technical parameters

reacted to ginipick's post with 🚀 21 days ago

Post

5145

🎬 Revolutionize Your Video Creation
Dokdo Multimodal AI Transform a single image into a stunning video with perfect audio harmony! 🚀

Superior Technology 💫
Advanced Flow Matching: Smoother video transitions surpassing Kling and Sora
Intelligent Sound System: Automatically generates perfect audio by analyzing video mood
Multimodal Framework: Advanced AI integrating image, text, and audio analysis
Outstanding Performance 🎯
Ultra-High Resolution: 4K video quality with bfloat16 acceleration
Real-Time Optimization: 3x faster processing with PyTorch GPU acceleration
Smart Sound Matching: Real-time audio effects based on scene transitions and motion
Exceptional Features ✨
Custom Audio Creation: Natural soundtrack matching video tempo and rhythm
Intelligent Watermarking: Adaptive watermark adjusting to video characteristics
Multilingual Support: Precise translation engine powered by Helsinki-NLP
Versatile Applications 🌟
Social Media Marketing: Create engaging shorts for Instagram and YouTube
Product Promotion: Dynamic promotional videos highlighting product features
Educational Content: Interactive learning materials with enhanced engagement
Portfolio Enhancement: Professional-grade videos showcasing your work
Experience the video revolution with Dokdo Multimodal, where anyone can create professional-quality content from a single image. Elevate your content with perfectly synchronized video and audio that captivates your audience! 🎨

Start creating stunning videos that stand out from the crowd - whether you're a marketer, educator, content creator, or business owner. Join the future of AI-powered video creation today!

ginipick/Dokdo-multimodal

#VideoInnovation #AITechnology #PremiumContent #MarketingSolution

🔊 Please turn on your sound for the best viewing experience!

1 reply

reacted to davanstrien's post with ❤️ 21 days ago

Post

3165

🇸🇰 Hovorte po slovensky? Help build better AI for Slovak!

We only need 90 more annotations to include Slovak in the next Hugging Face FineWeb2-C dataset ( data-is-better-together/fineweb-c) release!

Your contribution will help create better language models for 5+ million Slovak speakers.

Annotate here: data-is-better-together/fineweb-c.

Read more about why we're doing it: https://huggingface.co/blog/davanstrien/fineweb2-community

3 replies

posted an update 22 days ago

Post

2520

CS2 Highlights Video Dataset - nyuuzyou/cs2-highlights

A collection of 4,857 high-quality Counter-Strike 2 gameplay highlights featuring:

- Professional and competitive gameplay recordings at 1080p resolution
- Complete metadata including Steam IDs and clip titles
- Preview thumbnails for all videos
- Both 60 FPS (842 clips) and 120 FPS (4,015 clips) content
- Gameplay from Faceit and official competitive modes

This extensive highlights collection provides a valuable resource for developing and evaluating video-based AI applications, especially in esports and competitive gaming contexts. Released under Creative Commons Zero (CC0) license.

posted an update 25 days ago

Post

1321

🎮 GoodGame.ru Clips Dataset - nyuuzyou/goodgame

A collection of 39,280 video clips metadata from GoodGame.ru streaming platform featuring:

- Complete clip information including direct video URLs and thumbnails
- Streamer details like usernames and avatars
- Engagement metrics such as view counts
- Game categories and content classifications
- Released under Creative Commons Zero (CC0) license

This extensive clips collection provides a valuable resource for developing and evaluating video-based AI applications, especially in Russian gaming and streaming contexts.

reacted to nroggendorff's post with 😔 26 days ago

Post

3676

im so tired

3 replies

reacted to etemiz's post with 👀 29 days ago

Post

2315

As more synthetic datasets are made, we move slowly away from human alignment.

4 replies

replied to their post about 1 month ago

Yes, I don't want to pollute my subscribers' feeds (I've already had several people unsubscribe from me due to spam with reports).

Thanks for your work. Let me know if there is anything I can do to reduce your workload with my reports.

posted an update about 1 month ago

Post

835

🎓 Soloby.ru Russian Q&A Dataset - nyuuzyou/soloby

A collection of 744,131 educational question-answer pairs featuring:

- Complete Q&A content from the Soloby.ru educational platform
- Rich metadata including timestamps, authors, and categories
- Detailed question titles and corresponding answers
- Native Russian language content across various subjects
- Released under Creative Commons Zero (CC0) license

This extensive Q&A collection provides a valuable resource for developing and evaluating Russian language AI applications, especially in educational contexts. The structured format and diverse subject coverage make it ideal for training models to understand and generate Russian educational content.

2 replies

reacted to jwlben11's post with 🤗 about 1 month ago

Post

2144

What is the use of hugginface? How can I get up to speed on ML and AI and how to use this platform? Would be nice if there was a get started here section.

1 reply

reacted to cutechicken's post with 🔥👀 about 1 month ago

Post

3493

🎮 Introduction to the World's First 3D Tank Game Created Solely with Generative AI 🚀
The advancement of AI technology is revolutionizing game development paradigms. I embarked on a challenge to create a 3D tank game using "only AI assistance," pushing the boundaries of what's possible in AI-driven game development. 🤖
Following the success of my first 2D tank game ( cutechicken/tankwar) 🎯, I ventured into the more challenging realm of 3D FPS game development. Remarkably, using Hugging Face's AI tool ( VIDraft/mouse1), the basic game framework was generated in just one minute ⚡. The 3D modeling ( ginipick/SORA-3D) and sound effects ( fantaxy/Sound-AI-SFX) were also easily created with AI assistance.
The resulting game ( cutechicken/TankWar3D) represents arguably the world's first 3D FPS game created primarily with generative AI. 90% was accomplished through AI capabilities, with the remaining 10% comprising my post-processing work. 🎉
Key Technical Features: 🛠️

Complete 3D rendering system using Three.js 🖥️
Real-time physics-based collision detection and handling 💥
Dynamic shadow and lighting system ☀️
Real-time radar and enemy tracking system 🎯
Advanced particle effects system (explosions, smoke, fire) 💫
Dynamic sound system (engine, firing, explosion sounds) 🔊
AI-driven enemy strategy system (pursuit, evasion, combat) 🤖
Terrain-based tank tilt adjustment 🌍
Real-time crosshair targeting system 🎯
Dynamic UI system (health bars, ammo, score) 📊

Technical Implementation: ⚙️

Physics Engine: 🎳
Custom collision detection system
Dynamic obstacle handling
Real-time terrain interaction

AI Systems: 🧠
State-based AI behavior patterns
Dynamic pathfinding
Tactical decision-making system

Graphics: 🎨
PBR-based rendering
Dynamic particle system
Real-time shadow mapping

reacted to prithivMLmods's post with 🔥 about 1 month ago

Post

2689

strangerzonehf/Flux-Sketch-Flat-LoRA

reacted to csabakecskemeti's post with 👍 about 1 month ago

Post

4509

The AMD Instinct MI50 (~$110) is surprisingly fast for inference Quantized models.

This runs a Llama 3.1 8B Q8 with Llama.cpp
https://huggingface.co/spaces/DevQuasar/Mi50

A little blogpost about the HW
http://devquasar.com/uncategorized/amd-radeon-instinct-mi50-cheap-inference/

reacted to DualityAI-RebekahBogdanoff's post with 🔥 about 1 month ago

Post

2804

Training YOLO with Synthetic Data from Duality AI's Falcon🎮📊
Hi Huggingface community! 👋
Duality.ai is excited to share a Google Colab notebook that demonstrates how easy it is to train YOLOv8 using synthetic data generated in our Falcon simulation software—and see it work in the real world!

https://storage.googleapis.com/duality-public-share/syntheticDataWorks.ipynb

Train using synthetic images of a cereal box digital twin, then see it work on real-world images.🥣💛

Instructions for running this can be found here: https://falcon.duality.ai/secure/documentation/see-synth-work-no-specs
You'll have to create an account to view it, but the notebook by itself explains itself very well.

This method is a game-changer for cost-effective, scalable, and customizable datasets in computer vision.
Why Synthetic Data?🤔
- Precise Annotations: Get bounding boxes, segmentation masks, and more without manual effort.
- Customizable Scenarios: Simulate diverse conditions like lighting and weather.
What’s in the Notebook?📓
- Training & Evaluation: Train YOLOv8 with synthetic data and test its performance on real-world samples.
Try it Out! 🚀
Access the notebook here:
https://storage.googleapis.com/duality-public-share/syntheticDataWorks.ipynb
It’s fully documented and ready for you to explore and adapt.

Want to create your own custom datasets? Checkout Falcon here:
https://falcon.duality.ai/auth/sign-up
https://www.duality.ai/edu

Let’s Discuss!💬
How are you using synthetic data in ML projects? Let’s connect—drop your thoughts or questions below or on our Discord! https://discord.com/invite/dualityfalconcommunity

2 replies

reacted to julien-c's post with ❤️ about 1 month ago

Post

8423

After some heated discussion 🔥, we clarify our intent re. storage limits on the Hub

TL;DR:
- public storage is free, and (unless blatant abuse) unlimited. We do ask that you consider upgrading to PRO and/or Enterprise Hub if possible
- private storage is paid above a significant free tier (1TB if you have a paid account, 100GB otherwise)

docs: https://huggingface.co/docs/hub/storage-limits

We optimize our infrastructure continuously to scale our storage for the coming years of growth in Machine learning, to the benefit of the community 🔥

cc: @reach-vb @pierric @victor and the HF team

28 replies