Adina Yakefu's picture

Adina Yakefu

AdinaY

·

AI & ML interests

None yet

Recent Activity

reacted to Kseniase's post with 🔥 about 13 hours ago

15 types of attention mechanisms Attention mechanisms allow models to dynamically focus on specific parts of their input when performing tasks. In our recent article, we discussed Multi-Head Latent Attention (MLA) in detail and now it's time to summarize other existing types of attention. Here is a list of 15 types of attention mechanisms used in AI models: 1. Soft attention (Deterministic attention) -> https://huggingface.co/papers/1409.0473 Assigns a continuous weight distribution over all parts of the input. It produces a weighted sum of the input using attention weights that sum to 1. 2. Hard attention (Stochastic attention) -> https://huggingface.co/papers/1508.04025 Makes a discrete selection of some part of the input to focus on at each step, rather than attending to everything. 3. Self-attention -> https://huggingface.co/papers/1706.03762 Each element in the sequence "looks" at other elements and "decides" how much to borrow from each of them for its new representation. 4. Cross-Attention (Encoder-Decoder attention) -> https://huggingface.co/papers/2104.08771 The queries come from one sequence and the keys/values come from another sequence. It allows a model to combine information from two different sources. 5. Multi-Head Attention (MHA) -> https://huggingface.co/papers/1706.03762 Multiple attention “heads” are run in parallel. The model computes several attention distributions (heads), each with its own set of learned projections of queries, keys, and values. 6. Multi-Head Latent Attention (MLA) -> https://huggingface.co/papers/2405.04434 Extends MHA by incorporating a latent space where attention heads can dynamically learn different latent factors or representations. 7. Memory-Based attention -> https://huggingface.co/papers/1503.08895 Involves an external memory and uses attention to read from and write to this memory. See other types in the comments 👇

reacted to aifeifei798's post with 👀 1 day ago

一个加入水印的小程序 ```python from PIL import Image, ImageDraw, ImageFont def add_watermark(image): watermark_text = "AI Generated by DarkIdol FeiFei" # Ensure the input is an Image object if not isinstance(image, Image.Image): raise ValueError("Input must be a PIL Image object") width, height = image.size # Create a drawing object to draw on the image draw = ImageDraw.Draw(image) # Set the font size for the watermark text font_size = 10 # Set font size to 10 try: # Try to use a common font file font = ImageFont.truetype("Iansui-Regular.ttf", font_size) except IOError: # Use the default font if the specified font file is not found font = ImageFont.load_default() # Calculate the width and height of the watermark text using textbbox bbox = draw.textbbox((0, 0), watermark_text, font=font) text_width = bbox[2] - bbox[0] text_height = bbox[3] - bbox[1] # Calculate the position for the watermark text (bottom-right corner) x = width - text_width - 10 # 10 is the right margin y = height - text_height - 10 # 10 is the bottom margin # Add the watermark text to the image draw.text((x, y), watermark_text, font=font, fill=(255, 255, 255, 128)) # Return the modified image object return image ``` - 字体从https://fonts.google.com去找就可以了,程序都标注清楚了,自行修改

updated a Space 3 days ago

zh-ai-community/china-ai-policy-research

View all activity

Organizations

AdinaY's activity

upvoted a collection 4 days ago

🎬 Video model 2025

6 items • Updated 4 days ago • 3

upvoted a paper 4 days ago

Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation Model

Paper • 2503.07703 • Published 7 days ago • 30

upvoted a collection 4 days ago

Open-Sora 2.0

3 items • Updated 5 days ago • 10

upvoted 3 papers 5 days ago

Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning

Paper • 2503.07572 • Published 7 days ago • 34

YuE: Scaling Open Foundation Models for Long-Form Music Generation

Paper • 2503.08638 • Published 6 days ago • 56

Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia

Paper • 2503.07920 • Published 6 days ago • 91

upvoted 2 articles 5 days ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

5 days ago

• 284

Article

Open R1: Update #3

By

and 9 others •

6 days ago

• 233

upvoted a collection 5 days ago

🔊 Audio model 2025

6 items • Updated 5 days ago • 4

upvoted a paper 5 days ago

R1-Omni: Explainable Omni-Multimodal Emotion Recognition with Reinforcing Learning

Paper • 2503.05379 • Published 10 days ago • 32

upvoted a paper 6 days ago

WritingBench: A Comprehensive Benchmark for Generative Writing

Paper • 2503.05244 • Published 10 days ago • 15

upvoted a paper 7 days ago

TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation

Paper • 2503.04872 • Published 11 days ago • 14

upvoted 2 papers 10 days ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published 11 days ago • 94

MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval

Paper • 2412.14475 • Published Dec 19, 2024 • 55

upvoted a collection 10 days ago

MegaPairs

5 items • Updated 13 days ago • 4

upvoted a paper 11 days ago

Babel: Open Multilingual Large Language Models Serving Over 90% of Global Speakers

Paper • 2503.00865 • Published 15 days ago • 58

upvoted 3 papers 13 days ago

DiffRhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion

Paper • 2503.01183 • Published 14 days ago • 26

Qilin: A Multimodal Information Retrieval Dataset with APP-level User Sessions

Paper • 2503.00501 • Published 16 days ago • 11

Visual-RFT: Visual Reinforcement Fine-Tuning

Paper • 2503.01785 • Published 14 days ago • 66

upvoted a paper 17 days ago

BFS-Prover: Scalable Best-First Tree Search for LLM-based Automatic Theorem Proving

Paper • 2502.03438 • Published Feb 5 • 2