Running a Large Language Model is computationally expensive and requires a lot of memory, but through research and optimizations, these limitations are starting to fade away, to the point where we can run a LLM on a device that is 5 years old.
Size and quality
Microsoft’s Phi models are an example of these advancements, defined by them as…