Homepage
Open in app
MantisNLP
Sign in
Get started
Tagged in
Inference
MantisNLP
We are an AI consultancy focused on Natural Language Processing. We'll be writing about machine learning and natural language processing like: chatbots, open source software, and more…
More information
Followers
161
Elsewhere
More, on Medium
Inference
Andrei Apostol
in
MantisNLP
Nov 8, 2023
Knowledge Distillation — Techniques for Efficient Inference of LLMs (IV/IV)
Read more…
75
Andrei Apostol
in
MantisNLP
Nov 1, 2023
FlashAttention — Techniques for Efficient Inference of LLMs (III/IV)
Read more…
34
1 response
Andrei Apostol
in
MantisNLP
Oct 18, 2023
Techniques for Efficient Inference of LLMs (II/IV)
Read more…
15
Andrei Apostol
in
MantisNLP
Oct 3, 2023
Techniques for Efficient Inference of LLMs (I/IV)
Introduction
Read more…
10