Aseet Patel PRO

Aseetpatel21

AI & ML interests

AI LegalTech for US patent prosecution

Recent Activity

liked a model about 1 month ago
nvidia/Llama-3.1-Nemotron-70B-Instruct-HF
upvoted a collection about 1 month ago
Llama-3.1-Nemotron-70B
liked a Space about 1 month ago
numind/NuExtract-1.5
View all activity

Organizations

PatentOAResponse.com's profile picture Hugging Face for Legal's profile picture

Aseetpatel21's activity

updated a collection 4 months ago
Reacted to dhuynh95's post with ❤️ 4 months ago
view post
Post
✅New paper to ensure valid LLM output with SOTA LLMs like GPT4 by mixing it with OSS LLMs

Paper: arxiv.org/abs/2401.09967

Great paper showing how strong proprietary AI like #GPT4 can be paired with #OSS LLM to ensure LLM output validity, e.g. valid JSON.

Many devs complain that #LLMs cannot be reliably used in production if the output is not valid, for instance, if one wants to use LLMs to generate SQL queries or JSON, it is crucial that the output is valid.

Frameworks have arisen to constrain the outputs of the LLM to follow some constraints, like outlines (https://github.com/outlines-dev/outlines), but they assume access to logits.

This makes them incompatible with proprietary LLMs like GPT4 that don’t share logits, so one can only use open-source LLMs that are much less performant.

This paper shows how can use powerful proprietary LLMs like GPT4 to create a first unconstrained sketch and refine it using an OSS model like Llama 2 where logits are accessible, to rewrite the sketch following some specific constraints.

They show that GPT4 Precision can be increased by 14% (43% before, 57% after), by boosting it with constrained output on information extraction on Wiki-NRE!
updated a collection 4 months ago
liked a Space 4 months ago
upvoted an article 6 months ago
view article
Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

103