Open source, AutoGPT, Multimodel, and more…
2025-02-01
See what is trending on huggingface.co/
Most models are released in half-precision (16 bit):
See what is trending on huggingface.co/
The same models at 6-bit quantisation are:
This makes all models except the 70B model usable on high end consumer hardware. This makes all models except the 70B model usable on low to mid-range professional hardware.
2 big models in recend days:
Allow for allocating extra test-time compute:
Mainstream AI agent are expected to be one of the next big steps:
Sparks of Artificial General Intelligence: Early experiments with GPT-4
“The central claim of our work is that GPT-4 attains a form of general intelligence, indeed showing sparks of artificial general intelligence. This is demonstrated by its core mental capabilities (such as reasoning, creativity, and deduction), its range of topics on which it has gained expertise (such as literature, medicine, and coding), and the variety of tasks it is able to perform (e.g., playing games, using tools, explaining itself, …). A lot remains to be done to create a system that could qualify as a complete AGI. We conclude this paper by discussing several immediate next steps, regarding defining AGI itself, building some of missing components in LLMs for AGI, as well as gaining better understanding into the origin of the intelligence displayed by the recent LLMs.”
Next steps for LLMs