Jayan Kesavan

jayan12k

jayan12k

AI & ML interests

Young developer. Interested in advancing every aspect of artificial intelligence and making new discoveries while being fully open source. Currently working on an advanced language model from scratch.

Recent Activity

liked a dataset 4 days ago

Skylion007/openwebtext

liked a dataset 4 days ago

omegalabsinc/omega-multimodal

liked a model 4 days ago

Wan-AI/Wan2.1-T2V-14B

View all activity

Organizations

None yet

jayan12k's activity

liked 2 datasets 4 days ago

Skylion007/openwebtext

Updated May 17, 2024 • 86.8k • 400

omegalabsinc/omega-multimodal

Viewer • Updated less than a minute ago • 117M • 117k • 47

liked 2 models 4 days ago

Wan-AI/Wan2.1-T2V-14B

Text-to-Video • Updated 1 day ago • 207k • • 1.01k

Lightricks/LTX-Video

Text-to-Video • Updated about 21 hours ago • 153k • • 1.07k

New activity in jayan12k/Finecode 4 days ago

Release date?

#2 opened 8 days ago by

orionweller

liked 2 models 8 days ago

apple/OpenELM-3B-Instruct

Text Generation • Updated 13 days ago • 11k • 330

apple/DepthPro-hf

Depth Estimation • Updated 13 days ago • 33.1k • 43

liked 2 datasets 10 days ago

tiiuae/falcon-refinedweb

Viewer • Updated Jun 20, 2023 • 968M • 53.5k • 841

HuggingFaceFW/fineweb-edu-score-2

Viewer • Updated Jan 31 • 13.1B • 162k • 72

reacted to mkurman's post with 👍 10 days ago

Post

3670

Introducing a new architecture, MedIT One – a single-token transformer with LSTM-like recurrence.

It is extremely fast in training and inference, but we lack funding for large-scale training. Enjoy 🍓

https://github.com/MedITSolutionsKurman/medit-one

reacted to Kseniase's post with 🔥 10 days ago

Post

6070

9 types of "Chain-of-..." approaches:

Chain-of-Thought (CoT) prompting enhances reasoning in AI models by breaking down complex problems into step-by-step logical sequences. It continues proving its effectiveness, especially in top-performing reasoning models. However, there are other similar methods, that expand CoT and can be used for different purposes. Here are 9 of them:

1. Chain-of-Action-Thought (COAT) -> Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search (2502.02508)
Helps model decide when to keep thinking, double-check their work, or try a different approach, using special guiding tokens.

2. Chain of Draft (CoD) -> Chain of Draft: Thinking Faster by Writing Less (2502.18600)
It helps model generate short but meaningful reasoning steps, cutting costs and making processing faster

3. Chain-of-Agents -> Chain of Agents: Large Language Models Collaborating on Long-Context Tasks (2406.02818)
Uses multi-agent collaboration: Worker agents process text parts in a structured chain, and manager agent summarizes the results

4. Chain-of-RAG ->https://huggingface.co/papers/2501.14342
Creates retrieval chains, instead of retrieving all info at once. It can dynamically adjust its search process and its parameters like step number

5. Chain-of-Shot Prompting (CoS) -> CoS: Chain-of-Shot Prompting for Long Video Understanding (2502.06428)
Helps models pick frames crucial for understanding a video, using a binary video summary and video co-reasoning module.

6. Chain of Hindsight (CoH) -> Chain of Hindsight Aligns Language Models with Feedback (2302.02676)
Converts all feedback into sequences to fine-tune the model and refine outputs

7. Chain-of-Note (CoN) -> Chain-of-Note: Enhancing Robustness in Retrieval-Augmented Language Models (2311.09210)
Generates sequential reading notes for each retrieved document to assess relevance before integrating info into the final answer

8. Chain of Diagnosis (CoD) -> CoD, Towards an Interpretable Medical Agent using Chain of Diagnosis (2407.13301)
Transforms the diagnostic process into a diagnostic chain

9. Chain(s)-of-Knowledge -> https://www.turingpost.com/p/cok
Enhance LLMs by dynamically pulling in external knowledge to improve accuracy and reduce errors