AI tools personal benchmark
+ what tool use for what
---
(list for future ref - not a ranking)
● chatgpt
● claude
● grok
● perplexity
● gemini (google)
● mistral
🟦 1. ChatGPT (OpenAI)
🔹 Best for: Versatile conversation, creativity, and coding.
🔹 Strengths:
Strong at writing, brainstorming, and problem-solving.
Solid coding assistant with explanations.
Good memory (if enabled).
Access to plugins and tools (varies by version).
🟦 2. Claude (Anthropic)
🔹 Best for: Long documents, reasoning, and structured analysis.
🔹 Strengths:
Handles long context well.
More cautious, ethical responses.
Good for business and legal-style writing.
🟦 3. Grok (xAI)
🔹 Best for: Edgy, real-time insights (integrated with X/Twitter).
🔹 Strengths:
Supposedly "witty" and uncensored compared to others.
Designed for real-time news & trending topics.
Integrated into Elon Musk's ecosystem (X, Tesla, etc.).
🟦 4. Perplexity AI
🔹 Best for: Search, research, citations.
🔹 Strengths:
Focused on retrieval-augmented generation (RAG).
Provides sources for claims.
Good for fact-checking and live information.
🟦 5. Gemini (Google DeepMind)
🔹 Best for: Multimodal AI (text, images, code).
🔹 Strengths:
Strong on images + video + text processing.
Ties into Google’s ecosystem (Docs, Search, YouTube, etc.).
Good for summarizing web pages.
🟦 6. Mistral
🔹 Best for: Open-source, local AI models.
🔹 Strengths:
Focused on open-source AI.
Competes with LLaMA models (Meta).
Good for custom AI deployments.
AI Model
💻 Coding
✍️ Writing
🧠 Thinking
💬 Daily Convos
ChatGPT 🤖
★★★★☆
★★★★★
★★★★☆
★★★★★
Claude 🦙
★★★☆☆
★★★★☆
★★★★★
★★★★☆
Grok 🚀
★★☆☆☆
★★★☆☆
★★★☆☆
★★★★☆
Perplexity 🔎
★★☆☆☆
★★★☆☆
★★★★☆
★★☆☆☆
Gemini 🌐
★★★★☆
★★★★☆
★★★★☆
★★★★☆
Mistral 🏴☠️
★★★★☆
★★★☆☆
★★★☆☆
★★★☆☆
● ● ●
=> ChatGPT 🤖
💻 Coding: ★★★★☆
--> Great for debugging, explanations, and full script generation.
✍️ Writing: ★★★★★
--> One of the best for storytelling, essays, and creative writing.
🧠 Thinking: ★★★★☆
--> Logical and structured, but not always the best at real-time updates.
💬 Daily Convos: ★★★★★
--> Super smooth, feels natural and engaging.
=> Claude 🦙
💻 Coding: ★★★☆☆
--> Decent, but not as advanced as ChatGPT for technical help.
✍️ Writing: ★★★★☆
--> Fantastic for long-form and structured documents.
🧠 Thinking: ★★★★★
--> One of the best for reasoning, analysis, and ethical considerations.
💬 Daily Convos: ★★★★☆
--> Feels formal but still friendly and insightful.
=> Grok 🚀
💻 Coding: ★★☆☆☆
--> Basic, not its main focus.
✍️ Writing: ★★★☆☆
--> Edgy and casual, but not deep or structured.
🧠 Thinking: ★★★☆☆
--> Decent, but prioritizes humor and bold takes.
💬 Daily Convos: ★★★★☆
--> Fun and witty, great for casual back-and-forth.
=> Perplexity 🔎
💻 Coding [★★☆☆☆]
--> Weak, mostly good for finding coding resources rather than generating.
✍️ Writing: ★★★☆☆
--> Clear and factual, but lacks creativity.
🧠 Thinking: ★★★★☆
--> Strong at fact-checking and sourcing information.
💬 Daily Convos: ★★☆☆☆
--> Feels like talking to a search engine rather than a person.
=> Gemini 🌐
💻 Coding: ★★★★☆
--> Advanced, almost on par with ChatGPT.
✍️ Writing: ★★★★☆
--> Good, but leans towards concise and structured over creative.
🧠 Thinking: ★★★★☆
--> Well-rounded, uses Google Search for more accurate real-world data.
💬 Daily Convos: ★★★★☆
--> Feels polished, but sometimes a bit scripted.
=> Mistral 🏴☠️
💻 Coding: ★★★★☆
--> Good, especially for self-hosted AI and open-source models.
✍️ Writing: ★★★☆☆
--> Basic, better for short, to-the-point responses.
🧠 Thinking: ★★★☆☆
--> Decent, but lacks the advanced reasoning of proprietary models.
💬 Daily Convos: ★★★☆☆
--> Functional, but less engaging than ChatGPT or Claude.
------------
📄 there is way more useful AI tools for music/image/video/code generation. For future exploration here...