Mitra's picture

5 2 12

Mitra

NeelM0906

·

AI & ML interests

None yet

Recent Activity

updated a Space about 1 month ago

NeelM0906/GaifeAgrilla

View all activity

Organizations

None yet

NeelM0906's activity

updated a Space about 1 month ago

GaifeAgrilla

Agrilla framework to generate data

liked a Space 2 months ago

Running on Zero

Phi Vision Math Assistant

upvoted a collection 2 months ago

DataGemma Release

A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated Sep 12 • 78

updated a model 2 months ago

NeelM0906/GaifeLM

liked a dataset 2 months ago

SkunkworksAI/reasoning-0.01

Viewer • Updated Sep 14 • 29.9k • 447 • 268

New activity in unsloth/gemma-2-9b 2 months ago

AttributeError: 'NoneType' object has no attribute 'store_cubin'

#1 opened 2 months ago by

updated a model 2 months ago

NeelM0906/gemma_block_selection

Updated Sep 17 • 8

liked a model 2 months ago

unsloth/gemma-2-9b

Text Generation • Updated Sep 3 • 5.26k • 10

updated a dataset 2 months ago

NeelM0906/Block_Selection

Viewer • Updated Sep 14 • 500 • 35

liked a model 2 months ago

unsloth/gemma-7b-bnb-4bit

Text Generation • Updated Sep 3 • 3.32k • 17

New activity in unsloth/gemma-7b-bnb-4bit 2 months ago

No module named 'triton'

#3 opened 2 months ago by

Reacted to tomaarsen's post with 🔥 2 months ago

Post

3710

🚀 Sentence Transformers v3.1 is out! Featuring a hard negatives mining utility to get better models out of your data, a new strong loss function, training with streaming datasets, custom modules, bug fixes, small additions and docs changes. Here's the details:

⛏ Hard Negatives Mining Utility: Hard negatives are texts that are rather similar to some anchor text (e.g. a question), but are not the correct match. They're difficult for a model to distinguish from the correct answer, often resulting in a stronger model after training.
📉 New loss function: This loss function works very well for symmetric tasks (e.g. clustering, classification, finding similar texts/paraphrases) and a bit less so for asymmetric tasks (e.g. question-answer retrieval).
💾 Streaming datasets: You can now train with the datasets.IterableDataset, which doesn't require downloading the full dataset to disk before training. As simple as "streaming=True" in your "datasets.load_dataset".
🧩 Custom Modules: Model authors can now customize a lot more of the components that make up Sentence Transformer models, allowing for a lot more flexibility (e.g. multi-modal, model-specific quirks, etc.)
✨ New arguments to several methods: encode_multi_process gets a progress bar, push_to_hub can now be done to different branches, and CrossEncoders can be downloaded to specific cache directories.
🐛 Bug fixes: Too many to name here, check out the release notes!
📝 Documentation: A particular focus on clarifying the batch samplers in the Package Reference this release.

Check out the full release notes here ⭐: https://github.com/UKPLab/sentence-transformers/releases/tag/v3.1.0

I'm very excited to hear your feedback, and I'm looking forward to the future changes that I have planned, such as ONNX inference! I'm also open to suggestions for new features: feel free to send me your ideas.

3 replies

·

liked a model 2 months ago

appvoid/arco

Updated Sep 19 • 61 • 12

liked a Space 3 months ago

Running on CPU Upgrade

Kolors Virtual Try-On

upvoted a paper 3 months ago

Configurable Foundation Models: Building LLMs from a Modular Perspective

Paper • 2409.02877 • Published Sep 4 • 27

New activity in NeelM0906/Workflow_Selection 3 months ago

[bot] Conversion to Parquet

#1 opened 3 months ago by

parquet-converter

New activity in unsloth/mistral-7b-instruct-v0.3 3 months ago

ValueError: The following `model_kwargs` are not used by the model: ['num_logits_to_keep'] (note: typos in the generate arguments will also show up in this list)

#1 opened 3 months ago by

liked a model 3 months ago

unsloth/mistral-7b-instruct-v0.3

Text Generation • Updated Sep 11 • 6.07k • 4

updated a dataset 3 months ago

NeelM0906/Workflow_Selection

Viewer • Updated Sep 4 • 5k • 34

liked a dataset 4 months ago

Salesforce/xlam-function-calling-60k

Viewer • Updated Jul 19 • 60k • 2.71k • 388