VILARIN

vilarin

AI & ML interests

Pantheon

Recent Activity

posted an update 1 day ago
updated a Space 1 day ago
vilarin/lumiere
upvoted an article 2 days ago

Organizations

vilarin's activity

posted an update 1 day ago
view post
Post
704
๐Ÿ„โ€โ™‚๏ธWhile browsing new models, I stumbled upon Lumiere from aixonlab. After testing it, I feel it has considerable potential. Keep up the good work!

Lumiere Alpha is a model focusing on improving realism without compromising prompt coherency or changing the composition completely from the original Flux.1-Dev model.

๐Ÿฆ„ Model: aixonlab/flux.1-lumiere-alpha

๐Ÿฆ– Demo: vilarin/lumiere
  • 1 reply
ยท
reacted to merve's post with ๐Ÿ‘€ 30 days ago
reacted to merve's post with ๐Ÿ”ฅ 3 months ago
view post
Post
5508
I have put together a notebook on Multimodal RAG, where we do not process the documents with hefty pipelines but natively use:
- vidore/colpali for retrieval ๐Ÿ“– it doesn't need indexing with image-text pairs but just images!
- Qwen/Qwen2-VL-2B-Instruct for generation ๐Ÿ’ฌ directly feed images as is to a vision language model with no processing to text!
I used ColPali implementation of the new ๐Ÿญ Byaldi library by @bclavie ๐Ÿค—
https://github.com/answerdotai/byaldi
Link to notebook: https://github.com/merveenoyan/smol-vision/blob/main/ColPali_%2B_Qwen2_VL.ipynb
reacted to clem's post with ๐Ÿ”ฅ 3 months ago
posted an update 3 months ago
posted an update 3 months ago
view post
Post
6011
๐Ÿคฉ Amazing day. AWPortrait-FL finally here!
๐Ÿฆ– AWPortrait-FL is finetuned on FLUX.1-dev using the training set of AWPortrait-XL and nearly 2,000 fashion photography photos with extremely high aesthetic quality.

๐Ÿค—Model: Shakker-Labs/AWPortrait-FL

๐Ÿ™‡Demo: vilarin/flux-labs

ยท
posted an update 3 months ago
posted an update 4 months ago
view post
Post
4186
Black Forest Labs, BASED! ๐Ÿ‘
FLUX.1 is more delightful, with good instruction following.
FLUX.1 dev( black-forest-labs/FLUX.1-dev) with a 12B parameter distillation model, second only to Black Forest Labs' state-of-the-art model FLUX.1 pro. ๐Ÿ™€

Update ๐Ÿค™Official demo:
black-forest-labs/FLUX.1-dev
  • 1 reply
ยท
replied to merve's post 6 months ago
view reply

Thank you :) I updated the demo to support file.

reacted to merve's post with โค๏ธ 6 months ago
view post
Post
2736
THUDM has released GLM-4V-9B and it's.. chatty! ๐Ÿ˜‚
I asked it to describe my favorite Howl's Moving Castle scene and here's how it went ๐Ÿ‘‡๐Ÿป

joke aside it seems to outperform the previous VLMs. however the license isn't open-source ๐Ÿ“ˆ
model repo: THUDM/glm-4v-9b
a community member has built a demo: vilarin/VL-Chatbox
  • 1 reply
ยท