🍳 Appliance Guide

Every kitchen needs the right appliance for the job. Here's what the main AI models are good at — and where they fall short.

All the major models can do most things. The differences are at the margins — and the margins matter when you're using AI every day.

🟢

OpenAI

ChatGPT (GPT-4o)

Try it →

The household name. Versatile and widely integrated.

✅ Strengths

Broad knowledge
Code & debugging
Plugin/tool ecosystem
Image generation (DALL·E)
Voice mode

⚠️ Weaknesses

Can be sycophantic
Long context less reliable
Costs add up on heavy use

🟠

Anthropic

Claude (Sonnet / Opus)

Try it →

Best for long documents, nuanced writing, and following complex instructions.

✅ Strengths

200K+ token context
Nuanced, careful writing
Strong instruction-following
Less sycophantic
Document analysis

⚠️ Weaknesses

No image generation
Less plugin ecosystem
Can be cautious on edge cases

🔵

Google

Gemini (1.5 Pro)

Try it →

Google's model. Strong at search-grounded answers and multimodal tasks.

✅ Strengths

1M token context
Google Search grounding
Strong with data/spreadsheets
Multimodal (text, image, video, audio)
Free generous tier

⚠️ Weaknesses

Less consistent writing quality
Personality feels flatter
Newer ecosystem

🟣

Perplexity AI

Perplexity

Try it →

AI search engine. Best when you need current, cited information.

✅ Strengths

Real-time web search
Cites sources
Great for research
Fast answers with links
Good free tier

⚠️ Weaknesses

Not great for generation tasks
Less creative
Not for long documents

⚫

Open source

Mistral / Llama (Local)

Try it →

Run AI locally. Private, free, no subscription.

✅ Strengths

100% private
No cost after setup
No rate limits
Customisable
Offline capable

⚠️ Weaknesses

Requires setup (Ollama etc.)
Less capable than frontier models
No built-in web access

🌀

Google

Diffusion Gemma

Try it →

Generates entire text blocks and refines them in multiple passes using diffusion technology.

✅ Strengths

Up to 4× faster generation
Over 1,000 words per second on powerful hardware
Reads both sides of a sentence (bidirectional)
Can run locally on your computer
Fixes awkward text in the middle & fills gaps

⚠️ Weaknesses

Not designed for complex reasoning tasks
Requires local deployment setup for maximum speed

Not sure which to use?

The Test Kitchen lets you run the same prompt through multiple models and compare the outputs side-by-side.

Open the Test Kitchen →

Core

Collections

Ingredients

Before you cook

Set it and forget it

Stay safe

Compare

Test

See it in action

Gallery

Media

Learn from mistakes

Participate

Your kitchen

Support

🍳 Appliance Guide

ChatGPT (GPT-4o)

✅ Strengths

⚠️ Weaknesses

Claude (Sonnet / Opus)

✅ Strengths

⚠️ Weaknesses

Gemini (1.5 Pro)

✅ Strengths

⚠️ Weaknesses

Perplexity

✅ Strengths

⚠️ Weaknesses

Mistral / Llama (Local)

✅ Strengths

⚠️ Weaknesses

Diffusion Gemma

✅ Strengths

⚠️ Weaknesses

Not sure which to use?