Akihito Miyazaki's picture

Akihito Miyazaki PRO

Akjava

AI & ML interests

Iโ€™m creating a 3D/Voice Conversational AI Presentation Avatar with Diagrams

Recent Activity

Organizations

None yet

Akjava's activity

posted an update 3 days ago
view post
Post
2539
Initial API-Based Smolagents and Linear.app Integration Example
Akjava/linear-app-api-smolagents
In short,this example contain get_todo_issue() tool and add_comment(),change_state_reviewing() function to linear.app

Large language models, like 70B parameter models, can often readily utilize tools such as add_comment or change_state, potentially handling multiple issues concurrently.

However, smaller models may require repeated calls to a tool or even fail to utilize it entirely.

Therefore, this initial example focuses on the get_todo_issue() tool.
posted an update 4 days ago
view post
Post
702
A dataset of 50 instrumental music tracks generated with the DiffRhythm model, using 10 CC0-licensed instrument samples from OEPN Game Art.
Akjava/diffrhythm-instrument-cc0-oepngamearg-10x5-generated

I've released the dataset. It's a little skewed towards certain types of music. It might be interesting for people curious about the range of variations it can generate. It could also be a good starting point for experimenting with the Distrill model. I believe the quality is good enough to be used as background music for YouTube videos or probably as reference tracks for YuE or Udio.
posted an update 6 days ago
view post
Post
571
First Example of Direct Webhook-triggered AI Agent
Akjava/linear-app-webhook-smolagents

This space-code might be helpful as a reference if you want to receive issue changes from linea.app via webhook and handle them using Gradio on Hugging Face Spaces or locally with AI.

Imagine an agent, responding instantly.

In short, Huggingface published webhooks_server.py under Apache 2.0,I've adapted it to work with a very small part of linear.app.

https://github.com/huggingface/huggingface_hub/blob/main/src/huggingface_hub/_webhooks_server.py

Be caraful

This method can be used as an AI-powered content management system, but I'm not sure if Hugging Face will allow it.

Github
https://github.com/akjava/smolagents-examples
replied to their post 7 days ago
view reply

Sorry if there is any misunderstanding, English is not my native langauage.
This is an issue between me and huggingface.co.

reacted to m-ric's post with ๐Ÿ‘ 12 days ago
view post
Post
4693
We now have a Deep Research for academia: SurveyX automatically writes academic surveys nearly indistinguishable from human-written ones ๐Ÿ”ฅ

Researchers from Beijing and Shanghai just published the first application of a deep research system to academia: their algorithm, given a question, can give you a survey of all papers on the subject.

To make a research survey, you generally follow two steps, preparation (collect and organize papers) and writing (outline creation, writing, polishing). Researchers followed the same two steps and automated them.

๐ŸŽฏ For the preparation part, a key part is find all the important references on the given subject.
Researchers first cast a wide net of all relevant papers. But then finding the really important ones is like distilling knowledge from a haystack of information. To solve this challenge, they built an โ€œAttributeTreeโ€ object that structures key information from citations. Ablating these AttributeTrees significantly decreased structure and synthesis scores, so they were really useful!

๐Ÿ“ For the writing part, key was to get a synthesis that's both short and true. This is not easy to get with LLMs! So they used methods like LLM-based deduplication to shorten the too verbose listings made by LLMs, and RAG to grab original quotes instead of made-up ones.

As a result, their system outperforms previous approaches by far!

As assessed by LLM-judges, the quality score os SurveyX even approaches this of human experts, with 4.59/5 vs 4.75/5 ๐Ÿ†

I advise you to read the paper, it's a great overview of the kind of assistants that we'll get in the short future! ๐Ÿ‘‰ SurveyX: Academic Survey Automation via Large Language Models (2502.14776)
Their website shows examples of generated surveys ๐Ÿ‘‰ http://www.surveyx.cn/
posted an update 22 days ago
reacted to nyuuzyou's post with โค๏ธ about 2 months ago
view post
Post
1694
๐ŸŽจ Artfol Dataset - nyuuzyou/artfol

A collection of 1,892,816 artwork posts featuring:
- High-quality art pieces with various styles and techniques
- Complete metadata including artist IDs, titles, and moderation flags
- Content from Artfol social media platform

The dataset contains:
- Public domain artwork posts
- Artist attribution and identifiers
- Direct image URLs and web page links
- Content safety flags (NSFW, gore)
- Post titles and descriptions

All content is available under CC0 license, allowing unrestricted use including commercial applications.
posted an update about 2 months ago
view post
Post
655
I've released some spaces that demonstrates more advanced uses of MediaPipe-landmarks.

Head-pose-estimate
original mediapipeโ€™s detection is good on short angles,trained-model seems work and there are more improve space
Akjava/mediapipe-head-pose-estimation

generate-3d-head:gltf
this is simple and initial
Akjava/mediapipe-face-mesh-3d
Akjava/mediapipe-head-2d-spinning

prototype-faceswap
color adjust and transform soso work,need find a way to keep face features.
Akjava/mediapipe-face-skin-transform
reacted to davidberenstein1957's post with ๐Ÿ”ฅ 3 months ago
view post
Post
1720
Letโ€™s make a generation of amazing image-generation models

The best image generation models are trained on human preference datasets, where annotators have selected the best image from a choice of two. Unfortunately, many of these datasets are closed source so the community cannot train open models on them. Letโ€™s change that!

The community can contribute image preferences for an open-source dataset that could be used for building AI models that convert text to image, like the flux or stable diffusion families. The dataset will be open source so everyone can use it to train models that we can all use.

Blog: https://huggingface.co/blog/burtenshaw/image-preferences
posted an update 4 months ago
posted an update 4 months ago
view post
Post
539
Finaly I realesed mediapipe-face animation space.

Mediapipe 68-points Eyes-Closed and Mouth-Opened
Akjava/mediapipe-68-facial-guide-eyes-closed-mouth-opened

[Article]Results: Converted Guide Images(eyes-closed and mouth-opened) with Flux.1 schenll img2img/inpaint
https://huggingface.co/blog/Akjava/result-guide-image-eyes-mouth

All the other tools listed are designed to support Mediapipe Face Animation

Akjava/mediapipe-tools-672ffe8ee7b62763c31b70c7

Akjava/webp-3-frame-talking-animation-tools-672819ce4989f354cdbcc739
posted an update 4 months ago
view post
Post
561
hi All I just shared Spaces and Article.

This key feature is Mediapipe face landmarker
Apache Licensed and trained with own dataset.
Good licensed model who use Flux.1 schnell instead of Dev

[Spaces]
Mediapipe Face detect
Akjava/mediapipe-face-detect

Face crop and replace
Akjava/mediapipe-face-crop-and-replace

Mediapipe 68 landmark
Akjava/mediapipe-68-points-facial-landmark

Mediapipe 68 Face Mask
Akjava/mediapipe-68-points-facial-mask

[Articles]
Better img2img results with Flux.1 schnell by using ScaleUp or Sharpen or FillColor pre-processing
https://huggingface.co/blog/Akjava/img2img-pre-processing
posted an update 4 months ago
view post
Post
707
I've released several new Hugging Face Spaces.

My primary objective is to create consistent character facial animation using image-to-image techniques:

Akjava/CreateConsistentCharacterFacialAnimationWithImg2Img

A short-term goal is create simple talk-head animation.

WebP-3-Frame-Talking-Animation
Akjava/AIDiagramChatWithVoice-FaceCharacter

[Space]

- GPU tools
Flux1-schnell img2img
Akjava/flux1-schnell-img2img

Flux1-schnell Inpaint with mask-file
Akjava/flux1-schnell-img2img

- Tiny CPU tools
WebP-3F-TH - create webp animation from 3 images
OpenCV-Inapint - classic inpaint
Whitebalance - simple white balance
Paste Image - just paste image with mask
WebP Resize Convert - resize and convert webp-animation
posted an update 6 months ago
view post
Post
1424
Streaming Text-to-Speech Chat Demo (CPU Inference Client)

Akjava/mistral-7b-v0.3-matcha-tts-en

Please be patient, as it may take over a minute to load the ONNX model.

This demo utilizes an inference client, which may occasionally become unresponsive.