AI Beginner Crash Course
Cut through the buzzword confusion. Understand what all these names, tools, and concepts actually are — and how they relate to each other.
Who Builds the Models
The companies at the top of the pyramid. They build the engines — everything else is built on top.
What AI Can Do
The types of content AI can work with. "Multimodal" means it handles more than one.
The Models
The actual AI. The brain. Everything else is an interface sitting on top.
| Company | Model | Input | Output |
|---|---|---|---|
| Anthropic | Claude Opus 4.5 | TextImagesPDFs | Text |
| Claude Sonnet 4.5 | TextImagesPDFs | Text | |
| Claude Haiku 4.5 | TextImagesPDFs | Text | |
| OpenAI | GPT-5.2 | TextImagesAudio | TextAudio |
| GPT-5.2-Codex | TextImages | Text | |
| DALL-E 3 | Text | Images | |
| Sora 2 | TextImages | Video | |
| Gemini 3 Pro | TextImagesAudioVideo | Text | |
| Gemini 3 Flash | TextImagesAudioVideo | Text | |
| Imagen 3 | Text | Images | |
| Veo 2 | TextImages | Video | |
| xAI | Grok 4.1 | TextImages | Text |
| Aurora | Text | Images | |
| Meta | Llama 4 Scout | TextImages | Text |
| Llama 4 Maverick | TextImages | Text | |
| DeepSeek | DeepSeek-V3.2 | Text | Text |
| DeepSeek-R1 | Text | Text | |
| Alibaba Cloud | Qwen3-235B | Text | Text |
| Qwen3-Max | Text | Text | |
| Qwen-Image | Text | Images | |
| Mistral | Mistral Large 3 | Text | Text |
| Devstral 2 | Text | Text | |
| Cohere | Command A | Text | Text |
| Command A Vision | TextImages | Text | |
| AI21 Labs | Jamba Large | Text | Text |
| Jamba2 Mini | Text | Text | |
| Stability AI | Stable Diffusion 3.5 | TextImages | Images |
| Stable Video 4D 2.0 | TextImages | Video | |
| Stable Audio 2.5 | Text | Audio | |
| Midjourney | Midjourney V7 | Text | ImagesVideo |
| Runway | Gen-4.5 | TextImages | Video |
| Black Forest Labs | Flux 2 | TextImages | Images |
Where AI Runs
Cloud, local, or a mix of both.
Chat Interfaces
The apps you talk to. They're wrappers — not the AI itself.
Running AI Locally
Your machine, your models. No internet, no subscriptions, complete privacy.
ollama run llama3
and go.How It All Fits Together
The model is simple. Everything else is layers on top.
AI Coding Agents
From chat to coworker. They live inside your project — reading, editing, running, iterating.
| Agent | Terminal (CLI) | VS Code | Desktop App | Web Interface |
|---|---|---|---|---|
| Claude Code | ✓ | ✓ | ✓Claude Desktop | ✓claude.ai/code |
| Codex | ✓ | ✓ | ✓Standalone app | ✓chatgpt.com/codex |
| Gemini CLI | ✓ | ✓Companion ext. | — | — |
The Developer Toolkit
Not AI products — the environment you work in. The workbench, the tools, the infrastructure.
Git & GitHub
Version control and the connective tissue of modern software development.
Database & Backend Services
Where your app stores data. Backend-as-a-Service platforms that bundle database, auth, storage, and APIs.
APIs & API Keys
How software talks to software.
Hosting & Deployment
Putting it on the internet.