@KingNish on Hugging Face: "Introducing OpenGPT-4o https://huggingface.co/spaces/KingNish/OpenGPT-4o…"

Join the conversation

Join the community of Machine Learners and AI enthusiasts.

KingNish

posted an update May 14, 2024

Post

5078

Introducing OpenGPT-4o
KingNish/OpenGPT-4o

Features:
1️⃣ Inputs possible are Text ✏️, Text + Image 📝🖼️, Audio 🎧, WebCam📸
and outputs possible are Image 🖼️, Image + Text 🖼️📝, Text 📝, Audio 🎧
2️⃣ Flat 100% FREE 💸 and Super-fast ⚡.
3️⃣ Publicly Available before GPT 4o.

Future Features:
1️⃣ Chat with PDF (Both voice and text)
2️⃣ Video generation.
3️⃣ Sequential Image Generation.
4️⃣ Better UI and customization.

Note: It's not possible to reach level of complexity of GPT 4o because OpenAI has been developing GPT-4o from six months with a team of over 450+ experienced members, Whereas I am only One. Moreover, they haven't released it fully publicly, So, it remains a test model.

julien-c

May 14, 2024

this is working quite well!

osanseviero

May 14, 2024

I tried with the OAI example and it worked nicely! https://huggingface.co/spaces/KingNish/GPT-4o/discussions/1

Neilblaze

May 14, 2024

This is amazing!

victor

May 14, 2024

Out of curiosity did you use dev mode while building it?

KingNish

May 14, 2024

Yes, but how you know

PeepDaSlan9

May 14, 2024

I tried it

KingNish

May 15, 2024

any suggestions

AshScholar

May 17, 2024

This comment has been hidden

alybadara1803

May 17, 2024

what model did you use to build it ?
And is it possible to make a blog on how did you make it ?

KingNish

May 18, 2024

•

edited May 20, 2024

Super Chat Model - Idefics 2
Image Generation Model - Pollination Ai Api
Speech to Text - Nemo (API)
Voice Chat (Base Model) - Mixtral 8x7b (Inference API)
Text to Speech - Edge tts (API)
Live Chat (base model) - uform gen2 dpo