01 · Artificial Intelligence

The
Thinking
Machine


Artificial intelligence has crossed a threshold. For decades it was a specialised discipline, confined to narrow applications — chess engines, spam filters, recommendation algorithms. Then, between 2017 and 2023, a series of architectural breakthroughs produced systems capable of generating coherent prose, realistic imagery, functional code, and nuanced conversation. The transformer architecture, introduced by Google researchers in the landmark paper Attention Is All You Need, proved to be the key that unlocked large-scale language modelling.

Today, large language models (LLMs) such as GPT-4, Claude, and Gemini are deployed across healthcare, law, education, and software engineering. They summarise legal documents, assist with diagnoses, tutor students in real time, and write production-ready code. The economic implications are profound: McKinsey estimates that generative AI could add between $2.6 trillion and $4.4 trillion annually to the global economy.

Yet the technology is not without tension. Questions of accuracy — LLMs hallucinate, producing confident falsehoods — remain unresolved. The energy cost of training frontier models is significant; GPT-4 is estimated to have required tens of millions of dollars of compute and substantial carbon expenditure. And the labour displacement implications are beginning to register across white-collar sectors that previously considered themselves automation-proof.

Machine vision has matured in parallel. Convolutional neural networks, refined through ImageNet competitions, now outperform human radiologists on specific diagnostic tasks. Autonomous driving systems from Waymo and Tesla log millions of miles, learning from edge cases that no human driver could accumulate. Industrial inspection, satellite image analysis, and real-time security surveillance have all been transformed.

The frontier today is reasoning. Systems like OpenAI's o1 model and DeepMind's AlphaProof demonstrate that AI can now construct formal mathematical proofs and perform multi-step logical reasoning that was considered beyond reach just years ago. Whether this constitutes genuine understanding or sophisticated pattern completion remains one of the deepest open questions in cognitive science.

What is certain is that the pace of progress has not slowed. Multimodal models now process text, images, audio, and video simultaneously. Agentic systems execute complex workflows across tools and APIs without human intervention. The question is no longer whether AI will reshape society — it already has — but how deliberately that reshaping will be guided.


Technology Comparison

AI Systems at a Glance

System Developer Primary Capability Parameters (est.) Release
GPT-4o OpenAI Multimodal language, vision, code generation >1 trillion 2024
Claude 3.5 Anthropic Reasoning, safety, long-context understanding Undisclosed 2024
Gemini Ultra Google DeepMind Scientific reasoning, multimodal, coding Undisclosed 2024
Llama 3.1 Meta AI Open-weights general purpose LLM 405 billion 2024
Mistral Large Mistral AI Efficient instruction-following, multilingual ~123 billion 2024
AlphaProof Google DeepMind Formal mathematical proof generation N/A (RL agent) 2024

Watch: The Age of AI — How It Changes Everything