Akjava (Akihito Miyazaki)

posted an update 3 days ago

Post

2539

Initial API-Based Smolagents and Linear.app Integration Example
Akjava/linear-app-api-smolagents
In short,this example contain get_todo_issue() tool and add_comment(),change_state_reviewing() function to linear.app

Large language models, like 70B parameter models, can often readily utilize tools such as add_comment or change_state, potentially handling multiple issues concurrently.

However, smaller models may require repeated calls to a tool or even fail to utilize it entirely.

Therefore, this initial example focuses on the get_todo_issue() tool.

posted an update 4 days ago

Post

702

A dataset of 50 instrumental music tracks generated with the DiffRhythm model, using 10 CC0-licensed instrument samples from OEPN Game Art.
Akjava/diffrhythm-instrument-cc0-oepngamearg-10x5-generated

I've released the dataset. It's a little skewed towards certain types of music. It might be interesting for people curious about the range of variations it can generate. It could also be a good starting point for experimenting with the Distrill model. I believe the quality is good enough to be used as background music for YouTube videos or probably as reference tracks for YuE or Udio.

posted an update 6 days ago

Post

571

First Example of Direct Webhook-triggered AI Agent
Akjava/linear-app-webhook-smolagents

This space-code might be helpful as a reference if you want to receive issue changes from linea.app via webhook and handle them using Gradio on Hugging Face Spaces or locally with AI.

Imagine an agent, responding instantly.

In short, Huggingface published webhooks_server.py under Apache 2.0,I've adapted it to work with a very small part of linear.app.

https://github.com/huggingface/huggingface_hub/blob/main/src/huggingface_hub/_webhooks_server.py

Be caraful

This method can be used as an AI-powered content management system, but I'm not sure if Hugging Face will allow it.

Github
https://github.com/akjava/smolagents-examples

replied to their post 7 days ago

Sorry if there is any misunderstanding, English is not my native langauage.
This is an issue between me and huggingface.co.

posted an update 11 days ago

Post

491

2 replies

·

reacted to m-ric's post with 👍 12 days ago

Post

4693

We now have a Deep Research for academia: SurveyX automatically writes academic surveys nearly indistinguishable from human-written ones 🔥

Researchers from Beijing and Shanghai just published the first application of a deep research system to academia: their algorithm, given a question, can give you a survey of all papers on the subject.

To make a research survey, you generally follow two steps, preparation (collect and organize papers) and writing (outline creation, writing, polishing). Researchers followed the same two steps and automated them.

🎯 For the preparation part, a key part is find all the important references on the given subject.
Researchers first cast a wide net of all relevant papers. But then finding the really important ones is like distilling knowledge from a haystack of information. To solve this challenge, they built an “AttributeTree” object that structures key information from citations. Ablating these AttributeTrees significantly decreased structure and synthesis scores, so they were really useful!

📝 For the writing part, key was to get a synthesis that's both short and true. This is not easy to get with LLMs! So they used methods like LLM-based deduplication to shorten the too verbose listings made by LLMs, and RAG to grab original quotes instead of made-up ones.

As a result, their system outperforms previous approaches by far!

As assessed by LLM-judges, the quality score os SurveyX even approaches this of human experts, with 4.59/5 vs 4.75/5 🏆

I advise you to read the paper, it's a great overview of the kind of assistants that we'll get in the short future! 👉 SurveyX: Academic Survey Automation via Large Language Models (2502.14776)
Their website shows examples of generated surveys 👉 http://www.surveyx.cn/

posted an update 22 days ago

Post

2481

I shared smolagents examples

Akjava/open_Deep-Research-DuckDuckGo
Akjava/open_Deep-Research-DuckDuckGo-Groq

Replacing img-src to "#" in mdconvert.py help reducing tokens
I added translate final answer to my language

reacted to nyuuzyou's post with ❤️ about 2 months ago

Post

1694

🎨 Artfol Dataset - nyuuzyou/artfol

A collection of 1,892,816 artwork posts featuring:
- High-quality art pieces with various styles and techniques
- Complete metadata including artist IDs, titles, and moderation flags
- Content from Artfol social media platform

The dataset contains:
- Public domain artwork posts
- Artist attribution and identifiers
- Direct image URLs and web page links
- Content safety flags (NSFW, gore)
- Post titles and descriptions

All content is available under CC0 license, allowing unrestricted use including commercial applications.

posted an update about 2 months ago

Post

655

I've released some spaces that demonstrates more advanced uses of MediaPipe-landmarks.

Head-pose-estimate
original mediapipe’s detection is good on short angles,trained-model seems work and there are more improve space
Akjava/mediapipe-head-pose-estimation

generate-3d-head:gltf
this is simple and initial
Akjava/mediapipe-face-mesh-3d
Akjava/mediapipe-head-2d-spinning

prototype-faceswap
color adjust and transform soso work,need find a way to keep face features.
Akjava/mediapipe-face-skin-transform

reacted to davidberenstein1957's post with 🔥 3 months ago

Post

1720

Let’s make a generation of amazing image-generation models

The best image generation models are trained on human preference datasets, where annotators have selected the best image from a choice of two. Unfortunately, many of these datasets are closed source so the community cannot train open models on them. Let’s change that!

The community can contribute image preferences for an open-source dataset that could be used for building AI models that convert text to image, like the flux or stable diffusion families. The dataset will be open source so everyone can use it to train models that we can all use.

Blog: https://huggingface.co/blog/burtenshaw/image-preferences

posted an update 4 months ago

Post

503

Wanted to move eyes with Flux.1 schnell, prompts failed.Made a guide image, surprisingly useful on its own. inpaint/img2img works well with lower-strength.
Rolling/white eyes with Flux 1.schnell viable? Wanted?
[space] Mediapipe Change Eyes Direction
Akjava/mediapipe-change-eyes-direction
[article]Eyes Slide-Move:Classic-Inpainting fill hole and complete missing iris
https://huggingface.co/blog/Akjava/eyes-slide-move

posted an update 4 months ago

Post

539

Finaly I realesed mediapipe-face animation space.

Mediapipe 68-points Eyes-Closed and Mouth-Opened
Akjava/mediapipe-68-facial-guide-eyes-closed-mouth-opened

[Article]Results: Converted Guide Images(eyes-closed and mouth-opened) with Flux.1 schenll img2img/inpaint
https://huggingface.co/blog/Akjava/result-guide-image-eyes-mouth

All the other tools listed are designed to support Mediapipe Face Animation

Akjava/mediapipe-tools-672ffe8ee7b62763c31b70c7

Akjava/webp-3-frame-talking-animation-tools-672819ce4989f354cdbcc739

posted an update 4 months ago

Post

561

hi All I just shared Spaces and Article.

This key feature is Mediapipe face landmarker
Apache Licensed and trained with own dataset.
Good licensed model who use Flux.1 schnell instead of Dev

[Spaces]
Mediapipe Face detect
Akjava/mediapipe-face-detect

Face crop and replace
Akjava/mediapipe-face-crop-and-replace

Mediapipe 68 landmark
Akjava/mediapipe-68-points-facial-landmark

Mediapipe 68 Face Mask
Akjava/mediapipe-68-points-facial-mask

[Articles]
Better img2img results with Flux.1 schnell by using ScaleUp or Sharpen or FillColor pre-processing
https://huggingface.co/blog/Akjava/img2img-pre-processing

posted an update 4 months ago

Post

707

I've released several new Hugging Face Spaces.

My primary objective is to create consistent character facial animation using image-to-image techniques:

Akjava/CreateConsistentCharacterFacialAnimationWithImg2Img

A short-term goal is create simple talk-head animation.

WebP-3-Frame-Talking-Animation
Akjava/AIDiagramChatWithVoice-FaceCharacter

[Space]

- GPU tools
Flux1-schnell img2img
Akjava/flux1-schnell-img2img

Flux1-schnell Inpaint with mask-file
Akjava/flux1-schnell-img2img

- Tiny CPU tools
WebP-3F-TH - create webp animation from 3 images
OpenCV-Inapint - classic inpaint
Whitebalance - simple white balance
Paste Image - just paste image with mask
WebP Resize Convert - resize and convert webp-animation

posted an update 6 months ago

Post

1424

Streaming Text-to-Speech Chat Demo (CPU Inference Client)

Akjava/mistral-7b-v0.3-matcha-tts-en

Please be patient, as it may take over a minute to load the ONNX model.

This demo utilizes an inference client, which may occasionally become unresponsive.

Akihito Miyazaki PRO

AI & ML interests

Recent Activity

Organizations

Akjava's activity