Posts

Showing posts with the label Large Language Models

Why an LLM (Large Language Model) is Not Merely Predicting the Next Word

Image
  WHY IS AN LLM ABLE TO GENERATE OUTPUT THAT SEEMS TO BE MORE THAN JUST PREDICTING THE NEXT WORD?  (LLMs as Complex Adaptive Systems with properties of Emergence). If an LLM (Large Language Model) is all about predicting the next word via a probabilistic function, (as explained in all AI textbooks) how is it able to generate such creative, seemingly intelligent answers to our questions? My hypothesis is that LLMs are Complex Adaptive Systems (CAS) such as those found in Nature, the Physical Sciences, and Human Society. (weather, insect colonies, economies and financial markets, flocks of birds and shoals of fish as Superorganisms, stampedes, viruses, city traffic flows, fluid dynamics, all manner of networks with feedback loops).  One of the attributes of a CAS is that it exhibits the phenomenon of Emergence.  Emergence is when something complex and unexpected arises from the interactions of its individual components (Agents). Emergence happens when (1) there are int...