Monday, December 15, 2025

Claude vs GPT-5 vs Gemini: Stay Gold Buying and selling Experiment Week 1 – My Buying and selling – 7 October 2025


Auto-posted whereas I am in Tokyo. Working these checks 24/7 on VPS.

I have been operating the identical Gold buying and selling prompts by way of three totally different AI fashions for per week. Similar account, identical professional advisor (DoIt Alpha Pulse AI), fully totally different pondering patterns.

Here is what’s really occurring with Claude, GPT-5, and Gemini once they analyze Gold.

The Take a look at Setup (You Can Replicate This)

The Actual Immediate I am Utilizing

Present XAUUSD: [price] Final 3 H1 candles: [data] Session: [London/NY/Asian] Information right this moment: [economic calendar] Ought to I: Purchase/Promote/Maintain? Threat: 0.5% max Goal: Threat-reward 1:2 minimal Clarify reasoning in 50 phrases max.

Easy. Clear. Similar for all three fashions.

Testing Situations

  • Demo account: $5000
  • Every mannequin will get: $1500 allocation
  • Similar trades provided: All three see an identical setups
  • Choice tracked: Even once they say “Maintain”
  • Time recorded: Response velocity issues

Early Observations (Not Conclusions)

GPT-5: The Overthinker

Response time: 3-5 seconds

GPT-5 retains discovering patterns which may not exist. Yesterday it mentioned:

“The three-candle formation resembles the Could 2023 reversal sample mixed with present DXY weak point suggesting institutional accumulation nevertheless the quantity profile signifies…”

Drawback: By the point it finishes pondering, the entry is gone.

Attention-grabbing habits: It catches refined correlations. Observed that Gold was ignoring Greenback energy as a result of bond yields have been additionally rising. That is really subtle.

Present standing:

  • Indicators generated: 12
  • Trades taken: 4 (others too gradual)
  • Win charge: 50% (2 wins, 2 losses)
  • P&L: +45 pips

Claude Opus 4.1: The Velocity Dealer

Response time: 1-2 seconds

Claude makes choices FAST. Typically too quick. Its responses are like:

“Bullish. London open + assist held + Greenback weak. Purchase.”

Energy: In quick markets, Claude really will get fills. Throughout Wednesday’s volatility, it was the one mannequin that caught the reversal.

Weak point: Much less nuanced. Missed the Bond/Gold correlation fully.

Present standing:

  • Indicators generated: 18
  • Trades taken: 11
  • Win charge: 54% (6 wins, 5 losses)
  • P&L: +72 pips

Gemini 2.5: The Conservative One

Response time: 2-4 seconds (varies)

Gemini is extra cautious. Typically passes on trades the others take. Tuesday it mentioned:

“No clear edge. Recommend ready for higher setup.”

This occurs extra with Gemini than GPT or Claude.

Surprising energy: Threat administration. When unsure, it usually suggests smaller positions. The one mannequin that recurrently says “cut back danger to 0.25%” when confidence is decrease.

Minor weak point: Typically TOO conservative, lacking good strikes whereas ready for “excellent” setups.

Present standing:

  • Indicators generated: 9
  • Trades taken: 5
  • Win charge: 60% (3 wins, 2 losses)
  • P&L: +38 pips

The Attention-grabbing Discovery: They Typically Disagree

More often than not, they agree on path. However here is what occurred Thursday at London open:

Gold worth: 1952.30
Setup: Break above Asian excessive

  • GPT-5: “Look forward to pullback to 1950”
  • Claude: “Purchase now, momentum constructing”
  • Gemini: “Purchase however smaller place”

Similar bullish bias, totally different approaches to entry.

Claude entered instantly. Gold ran to 1958. Claude bought one of the best entry.
However all three would have been worthwhile – simply totally different quantities.

What’s Truly Useful Right here

Velocity vs Intelligence Commerce-off

  • Want quick choices? Claude
  • Want deep evaluation? GPT-5
  • Want danger administration? Gemini (surprisingly)

Price Per Choice (This Week)

  • GPT-5: $0.12 common
  • Claude: $0.08 common
  • Gemini: $0.06 common

Claude is 33% cheaper AND sooner. However GPT-5’s two wins have been greater (+40 and +35 pips vs Claude’s common of +20).

The “Confidence” Drawback

None of those fashions say “I do not know” sufficient. They at all times have an opinion, even once they should not.

I am testing including this to prompts:

If unclear, say "No edge - skip this setup"
Confidence required: 70% minimal 

Early outcomes: 40% fewer alerts, however higher win charge.

The Framework That is Rising

After one week, here is what I am studying:

Use Claude When:

  • Information is about to hit (velocity issues)
  • London/NY session opens (momentum trades)
  • You want fast choices on clear setups

Use GPT-5 When:

  • Asian session (extra time to suppose)
  • Advanced correlations matter
  • You may await excellent entries

Use Gemini When:

  • You desire a second opinion
  • Threat administration is precedence
  • Testing new methods (it is extra conservative)

What’s Truly Working Nicely

Easy Operations

One factor that stunned me – DoIt Alpha Pulse AI handles all three fashions with out points:

  • No API errors (correct error dealing with in-built)
  • No charge restrict issues (clever request administration)
  • Constant connections throughout all fashions

That is really our aggressive benefit. Whereas others wrestle with integration, we simply… commerce.

The Actual Variations Are Refined

The fashions are extra comparable than totally different. All of them:

  • Catch primary assist/resistance
  • Perceive pattern path
  • React to main information

The variations are in type, not substance:

  • Claude: Direct and quick
  • GPT-5: Detailed and considerate
  • Gemini: Cautious and measured

The “Clarification Tax”

Asking for reasoning provides:

  • 1-2 seconds to response time
  • 2x the token price
  • Typically overthinking easy setups

However it’s price it for studying what the AI “sees”

What I am Testing Subsequent Week

Experiment 1: Consensus Buying and selling

Solely take trades the place 2 of three fashions agree. Idea: Larger conviction setups.

Experiment 2: Time-Primarily based Rotation

  • Asian: Gemini (conservative for quiet markets)
  • London: Claude (velocity for breakouts)
  • NY: GPT-5 (complexity of US session)

Experiment 3: Specialised Prompts

As an alternative of 1 immediate for all, optimize for every mannequin’s strengths:

  • Claude: Brief, action-focused
  • GPT-5: Embody correlation evaluation
  • Gemini: Add danger parameters

The Trustworthy Actuality

After one week of parallel testing, the fashions carry out equally on Gold buying and selling.

All of them catch the apparent strikes. The variations are marginal – perhaps 5-10% efficiency variance. The ability is not selecting the “proper” AI – it is writing higher prompts.

That is why DoIt Alpha Pulse AI helps all of them. Not as a gimmick, however as a result of totally different market situations want various kinds of pondering.

Your Homework Whereas I am in Japan

If in case you have DoIt Alpha Pulse AI, do this:

  1. Run the identical setup by way of totally different fashions
  2. Doc once they disagree
  3. Observe which one was proper
  4. Share findings

By the point I am again, we’ll have crowd-sourced information on which mannequin works finest for what.

The Questions I am Investigating in Tokyo

Assembly with quant merchants right here who’ve been utilizing AI longer:

  1. How do they deal with mannequin disagreement?
  2. What’s their method to consensus?
  3. How do they optimize for latency from Asia?
  4. Are there fashions we’re not contemplating?

Present Scoreboard (Week 1)

Velocity Champion: Claude (1-2 seconds)
Accuracy Chief: Gemini (60% win charge however small pattern)
Complexity Grasp: GPT-5 (catches refined patterns)
Price Winner: Gemini ($0.06/choice)
Reliability: Claude (most constant)

However bear in mind – that is one week of knowledge. Not conclusions, simply observations.

The Actual Worth of This Experiment

It isn’t about discovering the “finest” mannequin. It is about understanding that AI buying and selling technique is not one-size-fits-all.

Your buying and selling type, the pairs you commerce, your danger tolerance – all of them have an effect on which AI mannequin fits you.

That is why the immediate is extra necessary than the mannequin. An important immediate on Claude beats a nasty immediate on GPT-5 each time.

Need to run your individual AI mannequin experiments?

Get DoIt Alpha Pulse AI – Now $397

Helps all main AI fashions. Swap between them immediately. Discover what works for YOUR buying and selling.

P.S. – Nonetheless in Tokyo. These fashions are operating 24/7 on my VPS. After I test in from my lodge, I see Claude and GPT-5 arguing about whether or not 1958 is resistance or assist. Even AIs cannot agree on primary TA.

P.P.S. – In the event you’re testing fashions your self, doc every little thing. The patterns solely emerge with information, not hunches.

Related Articles

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Latest Articles