SACHIN KUMAR – Medium

SACHIN KUMAR

SACHIN KUMAR

DART-Math: Difficulty-Aware Rejection Tuning of LLMs for better Mathematical Problem-Solving

Previous works usually synthesize data from proprietary models to augment existing datasets, followed by instruction tuning to achieve…

5d ago

DART-Math: Difficulty-Aware Rejection Tuning of LLMs for better Mathematical Problem-Solving

5d ago

SACHIN KUMAR

Decoupled Refusal Training for improving Safety in LLMs

In this paper [1], authors introduce a novel approach, Decoupled Refusal Training (DeRTa), designed to empower LLMs to refuse compliance to…

Jul 16

Decoupled Refusal Training for improving Safety in LLMs

Jul 16

SACHIN KUMAR

Speculative RAG: enhancing RAG with multiple drafts generation and verification

Recent RAG advancements focus on improving retrieval outcomes through iterative LLM refinement or self-critique capabilities acquired…

Jul 14

Speculative RAG: enhancing RAG with multiple drafts generation and verification

Jul 14

SACHIN KUMAR

Lookback-lens: Detect and Mitigate Hallucinations in LLMs with Attention Maps

When summarizing articles or answering questions for a given passage, LLMs can hallucinate details and respond with inaccurate or…

Jul 10

Lookback-lens: Detect and Mitigate Hallucinations in LLMs with Attention Maps

Jul 10

SACHIN KUMAR

AutoDetect: Framework for Automated Weakness Detection in LLMs across various tasks

LLMs do show superiority in accomplishing certain tasks, but still exhibit significant but subtle weaknesses, such as mistakes in…

Jun 26

AutoDetect: Framework for Automated Weakness Detection in LLMs across various tasks

Jun 26

SACHIN KUMAR

Instruction Pre-Training: using instruction-response pairs to pre-train LLMs

Supervised multitask learning holds significant promise, as scaling it in the post-training stage trends towards better generalization.

Jun 23

Instruction Pre-Training: using instruction-response pairs to pre-train LLMs

Jun 23

SACHIN KUMAR

GraphReader: a graph based Agent to enhance long-context abilities of LLMs

Long-context capabilities in LLMs helps in tackling complex and long-input tasks, but that was hindered by challenges that persists in…

Jun 23

GraphReader: a graph based Agent to enhance long-context abilities of LLMs

Jun 23

SACHIN KUMAR

Evaluating Alignment and Vulnerabilities in LLMs-as-Judges

LLM-as-a-judge has emerged as an approach for evaluating large language models (LLMs), but still has many open questions about the…

Jun 20

Evaluating Alignment and Vulnerabilities in LLMs-as-Judges

Jun 20

SACHIN KUMAR

Meta-Reasoning Prompting : efficient system prompting method for LLMs inspired by human…

Traditional in-context learning-based reasoning techniques, such as Tree-of-Thoughts, show promise but lack consistent state-of-the-art…

Jun 18

Meta-Reasoning Prompting : efficient system prompting method for LLMs inspired by human…

Jun 18

SACHIN KUMAR

Mitigating Memorization in Generative LLMs to prevent training data leaks in responses

LLMs can memorize and repeat their training data, causing privacy and copyright risks. To mitigate memorization, authors of this paper[1]…

Jun 18

Mitigating Memorization in Generative LLMs to prevent training data leaks in responses

Jun 18

SACHIN KUMAR

SACHIN KUMAR

Staff MLE@Chegg/ Applied Generative AI Researcher | https://www.linkedin.com/in/techsachinkumar/

Following

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams