Diwank Tomer's picture

Diwank Tomer PRO

diwank

·

https://diwank.name

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

nvidia/OpenMath2-Llama3.1-70B-nemo

liked a dataset 1 day ago

nroggendorff/dictionary

updated a collection 1 day ago

Articles

CryptGPT: Privacy-Preserving Language Models Using Vigenere Cipher (Part 1)

Organizations

diwank's activity

upvoted a collection 1 day ago

InternVL 2.5

Better than InternVL 2.0 • 14 items • Updated 1 day ago • 7

upvoted 2 papers 4 days ago

MagicQuill: An Intelligent Interactive Image Editing System

Paper • 2411.09703 • Published 8 days ago • 50

Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models

Paper • 2411.04996 • Published 15 days ago • 48

upvoted a paper 6 days ago

LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models

Paper • 2411.09595 • Published 8 days ago • 65

upvoted a paper 7 days ago

PerceiverS: A Multi-Scale Perceiver with Effective Segmentation for Long-Term Expressive Symbolic Music Generation

Paper • 2411.08307 • Published 9 days ago • 6

upvoted a paper 8 days ago

Watermark Anything with Localized Messages

Paper • 2411.07231 • Published 11 days ago • 19

upvoted a paper 9 days ago

JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation

Paper • 2411.07975 • Published 10 days ago • 24

upvoted a paper 10 days ago

LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation

Paper • 2411.04997 • Published 15 days ago • 34

upvoted a paper 11 days ago

Emu3: Next-Token Prediction is All You Need

Paper • 2409.18869 • Published Sep 27 • 91

upvoted a collection 14 days ago

LipSync and Face Operations

10 items • Updated 14 days ago • 23

upvoted a paper 22 days ago

Dualformer: Controllable Fast and Slow Thinking by Learning with Randomized Reasoning Traces

Paper • 2410.09918 • Published Oct 13 • 3

upvoted an article 23 days ago

Article

Decoding Strategies in Large Language Models

By

•

24 days ago

• 38

upvoted a collection 23 days ago

NanoBEIR 🍺

A collection of smaller versions of BEIR datasets with 50 queries and up to 10K documents each. • 13 items • Updated Sep 11 • 6

upvoted a paper 23 days ago

The StatCan Dialogue Dataset: Retrieving Data Tables through Conversations with Genuine Intents

Paper • 2304.01412 • Published Apr 3, 2023 • 2

upvoted a collection 27 days ago

OmniCorpus

OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text • 6 items • Updated Oct 21 • 1

upvoted a paper 27 days ago

Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training

Paper • 2410.08202 • Published Oct 10 • 3

upvoted a collection 27 days ago

Mono-InternVL

A Pioneering Monolithic MLLM • 2 items • Updated Oct 21 • 4

upvoted an article about 1 month ago

Article

Advanced Flux Dreambooth LoRA Training with 🧨 diffusers

By

•

Oct 21

• 27

upvoted a paper about 1 month ago

Mini-Omni2: Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities

Paper • 2410.11190 • Published Oct 15 • 20

upvoted a collection about 1 month ago

Granite Guardian Models

A collection of models created by IBM for safeguarding language models. • 4 items • Updated 18 days ago • 13