Huiqiang Jiang – Medium

Huiqiang Jiang

Home

About

Pinned

Huiqiang Jiang
in
LlamaIndex Blog

LongLLMLingua: Bye-bye to Middle Loss and Save on Your RAG Costs via Prompt Compression

In the RAG, after the retrieval phase, it’s necessary to perform Re-ranking + Fine-Grained Prompt Compression + Subsequence Recovery to…

Nov 6, 2023

LongLLMLingua: Bye-bye to Middle Loss and Save on Your RAG Costs via Prompt Compression

Nov 6, 2023

Huiqiang Jiang

How to Optimize TTFT of 8B LLMs with 1M Tokens to 20s

If you aim to optimize an 8B model with a 1 million tokens TTFT (time-to-first-token) to 20 seconds, you might consider the following…

Jul 21

How to Optimize TTFT of 8B LLMs with 1M Tokens to 20s

Jul 21

Huiqiang Jiang

Huiqiang Jiang

RSDE @Microsoft Research SH

Following

The Medium Blog
LlamaIndex Blog
Ai2
Bill Yuchen Lin, PhD
Medium Staff

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams