Abdel-Dayane Marcos's picture

Abdel-Dayane Marcos

admarcosai

·

AI & ML interests

Natural Language Processing, Graph Neural Networks, Reinforcement Learning

Recent Activity

updated a collection about 10 hours ago

Pending Classification

updated a collection about 10 hours ago

Pending Classification

updated a collection about 10 hours ago

Pending Classification

View all activity

Organizations

None yet

admarcosai's activity

commented a paper about 11 hours ago

Generative Agents: Interactive Simulacra of Human Behavior

Paper • 2304.03442 • Published Apr 7, 2023 • 12 •

commented 6 papers 10 months ago

Training Transformers with 4-bit Integers

Paper • 2306.11987 • Published Jun 21, 2023 • 22 •

Training Transformers with 4-bit Integers

Paper • 2306.11987 • Published Jun 21, 2023 • 22 •

OneBit: Towards Extremely Low-bit Large Language Models

Paper • 2402.11295 • Published Feb 17 • 23 •

3D Gaussian Splatting for Real-Time Radiance Field Rendering

Paper • 2308.04079 • Published Aug 8, 2023 • 172 •

A Human-Inspired Reading Agent with Gist Memory of Very Long Contexts

Paper • 2402.09727 • Published Feb 15 • 36 •

Computing Power and the Governance of Artificial Intelligence

Paper • 2402.08797 • Published Feb 13 • 12 •

commented 4 papers 11 months ago

Efficiently Programming Large Language Models using SGLang

Paper • 2312.07104 • Published Dec 12, 2023 • 7 •

Scaling Laws for Downstream Task Performance of Large Language Models

Paper • 2402.04177 • Published Feb 6 • 17 •

Secrets of RLHF in Large Language Models Part II: Reward Modeling

Paper • 2401.06080 • Published Jan 11 • 26 •

Transformer-Based Models Are Not Yet Perfect At Learning to Emulate Structural Recursion

Paper • 2401.12947 • Published Jan 23 • 2 •

New activity in tongyx361/MathInstruct-Core-DifficultyAware 11 months ago

Meaning of err_rate in the dataset

#2 opened 11 months ago by

commented a paper about 1 year ago

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Paper • 2312.00752 • Published Dec 1, 2023 • 138 •

New activity in deepseek-ai/deepseek-coder-6.7b-instruct about 1 year ago

Trained on Code Search Net

#5 opened about 1 year ago by

commented a paper about 1 year ago

LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models

Paper • 2310.08659 • Published Oct 12, 2023 • 24 •

New activity in google-research-datasets/natural_questions over 2 years ago

Natural Questions is not streamable

#1 opened over 2 years ago by

New activity in community-datasets/wiki_snippets over 2 years ago

why does loading load_dataset('wiki_snippets', name='wiki40b_en_100_0') takes 3 hours when it only generates 12GB of data

#1 opened over 2 years ago by