Ling HuangLarge Language Models — Retrieval Augmented Generation (RAG), Part 7RAG vs. Long Context: Examining Frontier Large Language Models for Environmental Review Document Comprehension3d ago3d ago
Ling HuangUser Life Time ValueFor user life cycle analysis and methodology, it defines the two core goals and four levers to establish a data analysis system. It also…4d ago4d ago
Ling HuangMLOps — ML/DL Model DeploymentModel deployment is the process of making machine learning model accessible to someone or something else.4d ago4d ago
Ling HuangLarge Language Models — Retrieval Augmented Generation (RAG), Part 612 RAG Pain Points and Proposed Solutions to the core challenges of RAGJul 11Jul 11
Ling HuangLarge Language Model — LLM Model Efficient InferenceA Survey on Efficient Inference for Large Language ModelsJul 11Jul 11
Ling HuangLarge Language Model — LLM Model InfraMainstream LLM architecture Common architecture types:Jul 2Jul 2
Ling HuangLarge Language Model — LLM Agents, Part 4An AI Agent is a piece of software that performs tasks on behalf of a user. They can automate processes, make decisions, and interact…Jun 30Jun 30
Ling HuangLarge Language Models —Fine TuningTen Commandments to Deploy Fine-Tuned Models in ProdJun 28Jun 28
Ling HuangLarge Language Models — Retrieval Augmented Generation (RAG), Part 5Intro of EmbeddingJun 19Jun 19
Ling HuangFP8 from NVIDIAIn order to better understand FP8, this article will focus on four issues and goals to explain to you:Jun 19Jun 19