
The first is pre-training. All the language, videos, writings, and books of humanity are poured into a massive statistical blender. Within this blender, the sequence, origin, and context of knowledge are stripped away. The AI is then trained to look at enormous stretches of text and predict the next word. It becomes extraordinarily skilled at this — it can adopt any role, any persona, any mode of interaction. But it does not know how it arrived at any particular judgment. It is a mystery even to itself. Like observing a single grain of sugar in a glass of mixed fruit juice: did it come from the pineapple or the apple? Nobody knows.