ChatGPT Stock Picks vs. the S&P 500
Can you type "pick me 10 stocks" into an AI chatbot and beat the market? Dozens of experiments have tried it. Academic researchers have tested it. Someone even launched an autonomous GPT-4 trading agent with real money. This essay collects every credible test and tells you what actually happened.
Every Major Test, Ranked by Rigor
Not all AI stock-picking experiments are created equal. A TikToker asking ChatGPT for five stocks and checking back in a month is entertainment. A University of Chicago study testing GPT-4 across 150,000 firm-year observations with anonymized data is research. The results depend heavily on the methodology.
| Experiment | Model | Return | Benchmark | Alpha |
|---|---|---|---|---|
| UChicago (150K obs.) | GPT-4 CoT | +10% ann. | N/A* | +10% ann. |
| MarketSenseAI (multi-agent) | GPT-4o | +125.9% | +73.5% | +52.4% |
| ScienceDirect (anonymized) | Gemini 1.5 | Below index | S&P 500 | Negative |
| StockBench (real trading) | Multiple | Below B&H | Buy & hold | Negative |
| Finder.com (8 weeks) | GPT-3.5 | +4.9% | +3.0% | +1.9% |
| Motley Fool UK (9 months) | GPT-4 | +17.4% | +13.0% | +4.4% |
The pattern is inconsistent. Some experiments show strong outperformance. Others show underperformance. The most rigorous studies tend to find that LLMs struggle to beat simple buy-and-hold strategies in real-world trading conditions, even when they show promise in backtests.
AI Stock Pickers Love the Obvious Winners
When you ask ChatGPT, Claude, or Gemini to pick stocks, they tend to recommend the same companies: NVIDIA, Microsoft, Apple, Amazon, Alphabet. These are the most discussed companies in their training data. They're the companies with the most analyst coverage, the most earnings call transcripts, and the most bullish commentary.
In 2025, a Motley Fool writer compared ChatGPT's three picks (Microsoft, NVIDIA, Visa) against his own three picks (Amazon, Axon Enterprise, Uber). ChatGPT returned 17.4%. The human returned 23.1%. Both beat the S&P 500's 13%.
But ChatGPT's picks were the second and third largest companies in the index by market cap. You don't need artificial intelligence to suggest buying the biggest, most successful companies on Earth. That's the equivalent of asking a language model for restaurant recommendations and getting back "Try the most popular restaurant in the city."
The Tasks LLMs Are Actually Good At
The Tasks LLMs Get Wrong
How to Use AI for Stock Research Without Fooling Yourself
Use AI For
Screening stocks by quantitative criteria. Summarizing earnings calls and 10-K filings. Comparing companies within a sector on specific financial metrics. Generating investment theses you then verify with primary sources. Building multi-factor scoring models. Identifying data patterns in large datasets. AI works like a research analyst who never sleeps and reads everything but has no original insight about the future.
Don't Use AI For
Final buy/sell decisions. Timing entries and exits. Predicting macro events or regime changes. Replacing your own judgment on position sizing and risk management. Following "top 10 stock picks" from any chatbot without doing your own analysis. The most common mistake: treating an LLM's confident tone as evidence of accuracy. These models sound certain even when they're guessing.
How I Built This
Analysis based on published academic research, public experiment trackers, financial media reporting, and ETF performance data as of early 2026.