Files

T

garfield 0894d18db4 WIP: 6+ weeks of uncommitted EA development and preset tuning

Confluence EA (v1.16 → v1.20):
- Per-EA realized P&L tracking via history deals
- Weekly drawdown protection
- Warmup bars, pivot cache, state persistence
- Point-scaled pivot thresholds, ranging ATR factor
- Market filling mode helper per symbol

Grid EA (v3.1 → v4.1):
- Adaptive filters, adaptive entry, spread filter
- Session filter, breakeven, correlation caps, range drift
- Profit protection (stop-after-profit, cycle reports)
- Edge cleanup v5.0 — close wrong-side positions outside grid
- Master one-shot shutdown, grid state persistence

Presets:
- Fix GetOut=Y shutdown bug on 4 grid presets
- Relax ADXMax 18→40, widen RSI 20/80 across grid presets
- Standardize daily drawdown 3%→5%, add weekly 10%
- Increase grid lots 0.01→0.03
- Normalize confluence ATR thresholds per pair
- Add XAGUSD, EURCHF, EURGBP, AUDNZD presets

Docs & DevOps:
- April 23 audit files (preset mismatch, code review, checklist)
- n8n workflow and validation infrastructure updates
- AI agent analyses in notes/

Known issues carried forward:
- Shared drawdown budget contamination (both EAs)
- Confluence ranging-market threshold inversion
- Older grid presets missing v4.1 safety controls

2026-05-12 09:02:25 -04:00

22 KiB

Raw Blame History

Kimi Code CLI — Analysis 2: First Principles Deep Analysis

Date: 2026-05-12 Method: First-principles reasoning + adversarial self-interrogation Confidence Target: >90%

Iteration 1: State the Obvious, Then Destroy It

Surface-Level Conclusion from Analysis 1

"Both EAs have a shared drawdown budget contamination bug because dailyStartEquity uses whole-account equity while P&L is measured per-EA."

First-Principles Destruction

Q1: What is the fundamental purpose of a drawdown limit? A: To prevent catastrophic loss of capital. The limit should ensure that no single day (or week) can destroy the trading account.

Q2: If two EAs run on one account, do they share capital? A: Yes. There is one account balance. Every dollar lost by Grid EA is a dollar Confluence EA cannot use.

Q3: Then is whole-account drawdown actually the CORRECT metric? A: For total account protection, YES. But for per-strategy risk budgeting, NO.

Q4: Which architecture does the user actually want? Evidence from vault:

Apr 3: "Shared daily drawdown risk identified" → user was SURPRISED by this
Apr 5: "Shut down Grid EA to free shared drawdown budget" → user treats budgets as separable
Apr 23 audit: "Remove whole-account contamination from daily drawdown baseline" → explicit intent is isolation

Conclusion: The user wants per-EA isolation. The hybrid code is therefore WRONG.

Iteration 2: Is the "Fix" Really a Fix?

Proposed Fix (Architecture C from Analysis 1)

Capture baselineFloating at reset, then measure: dailyPnL = realizedToday + (currentFloating - baselineFloating)

Adversarial Questions

Q5: What if the EA has NO positions at day start? Then baselineFloating = 0, and dailyPnL = realizedToday + currentFloating. This is correct.

Q6: What if the EA has +$2,000 floating at day start, and by noon it has -$500 floating? dailyPnL = 0 + (-500 - 2000) = -$2,500. The EA has given back $2,500 of profit. Correct.

Q7: What if the EA closes a +$2,000 position at noon (realized +$2,000), but another position goes from +$1,000 to -$500? dailyPnL = 2000 + (-500 - 1000) = +$500. Net change is +$500. The EA is up $500 on the day. Correct.

Q8: What if account equity drops from $100k to $90k because Grid EA loses $10k, and Confluence EA has $0 floating change? dailyPnL = 0 + (0 - 0) = 0. Confluence EA has not lost anything. Its limit is still 5% of $90k = $4,500. This is correct per-EA isolation.

Q9: But is 5% of $90k the RIGHT limit? If the user's intent is "each EA can lose up to 5% of TOTAL ACCOUNT SIZE", then the limit should be 5% of $100k = $5,000 fixed. If the intent is "each EA can lose up to 5% of CURRENT ACCOUNT EQUITY", then $4,500 is correct.

Evidence: InpMaxDailyDrawdown = 5.0 with comment "Max daily EA drawdown % (per-EA)". The "per-EA" label suggests isolation. But the percentage base is ambiguous.

Recommendation: Use a FIXED baseline at first deployment (e.g., capture AccountInfoDouble(ACCOUNT_BALANCE) at EA init and never change it, or reset it weekly). Or explicitly use AccountInfoDouble(ACCOUNT_BALANCE) (not EQUITY) for the limit calculation to avoid intraday equity fluctuation contamination.

Actually, the current code uses ACCOUNT_EQUITY which includes floating P&L from ALL EAs at the moment of reset. Using ACCOUNT_BALANCE would be slightly cleaner because balance changes only on closed trades. But it still reflects other EAs' realized P&L.

The cleanest first-principles approach:

// At first-ever init:
static double allocatedCapital = AccountInfoDouble(ACCOUNT_BALANCE);
// At daily reset:
dailyLimit = allocatedCapital * InpMaxDailyDrawdown / 100.0;
// At check:
double dailyPnL = GetRealizedPnLToday() + (GetFloatingPnL() - baselineFloating);

This gives each EA a fixed slice of the initial balance, measured against its own performance.

Confidence boost: The fix is robust. +5% confidence.

Iteration 3: The Grid EA's Existential Question

First Principle: A Grid EA Is a Mean-Reversion Bet

It profits when price oscillates within a range. It loses when price trends.

Adversarial Questions

Q10: Does the Grid EA actually have a positive expected value? The vault shows:

Mar 22: Grid EA added
Mar 24-30: "Grid EA clustering bug caused ~$60K loss"
Apr 3: "Equity at 47% (~$47K)"
Apr 13: User frustrated that filters block all trades

This is a strategy that either: (a) Lost ~$60K in one week (clustering bug), OR (b) Trades so rarely that the user sees it as useless (strict filters)

Neither is a sign of positive expectancy.

Q11: What changed to make it profitable?

v3.1: Removed midnight position closure (good)
v3.3: Fixed GridHigh/GridLow to use ATR bands (good)
v4.1: Added session filter, spread filter, breakeven, correlation caps, adaptive entry, range drift
v5.0: Added edge cleanup
Presets: ADXMax relaxed 18→40, RSI widened, lots increased 0.01→0.03

The architecture is more sophisticated. But sophistication ≠ edge.

Q12: What is the mathematical edge of this grid? The Grid EA places limit orders inside a range (GridHigh to GridLow). It profits if price touches a limit, then reverses to hit TP. The edge comes from:

Pivot-based range selection (assumption: pivots predict reversal zones)
RSI/ADX/ATR filters (assumption: we can detect ranging vs trending)
ATR bands (assumption: volatility predicts range width)

First-principles challenge: Can RSI/ADX/ATR predict the next hour's price action with >50% accuracy? No. These are lagging indicators. ADX tells you whether the market WAS trending. It does not predict whether it WILL trend.

Q13: Then why did the user build this? The vault shows the original OrdersEA was a simpler grid. The "Smart Grid" evolution added filters to prevent the Mar 24-30 disaster. But the core problem remains: a grid accumulates losing positions in a trend. No filter can perfectly predict regime changes.

Q14: Is the edge cleanup (v5.0) a valid fix? Edge cleanup closes wrong-side positions when price exits the grid. This prevents unlimited accumulation. But it also:

Crystallizes losses that might have recovered
Creates a new risk: whipsaw. Price spikes above GridHigh → SELLs closed → price falls back → new SELLs opened at worse prices → repeated crystallization

Q15: What is the maximum loss of one grid cycle? Before edge cleanup: theoretically unlimited (adds levels until margin call) After edge cleanup: bounded by MaxLevels * Lots * (GridHigh - GridLow) per side, minus any TP hits

But with InpUseStopLoss = false (default), positions have no individual stop loss. The only exits are:

TP hit (profit)
Edge cleanup (loss)
Weekend close (could be loss)
Breakout close (loss)
Manual close

Q16: What is the actual risk/reward of one grid level?

Entry: GridHigh - n*EntryPoints (BUY side)
TP: Entry + TPPoints
SL: None (default)

Risk/Reward is ASYMMETRIC: limited reward (TP points), unlimited risk (no SL). Edge cleanup caps the risk but at an uncertain level.

This is structurally a negative-expectancy strategy UNLESS the win rate is extremely high (>70%+) and the average win is comparable to average loss. The vault does not show grid-specific win rates.

Confidence impact: The Grid EA's fundamental edge is questionable. But my job is to analyze the CODE, not the strategy. I can be confident about code bugs regardless of strategy validity. No confidence change.

Iteration 4: The Confluence EA's Signal Quality

First Principle: Confluence Improves Signal Quality ONLY If Signals Are Independent

If Pivot, Candlestick, and Harmonic signals all correlate (e.g., all bullish when price rises), then "2 of 3 agree" is just a lagging momentum filter.

Adversarial Questions

Q17: Are the signals independent?

Pivot signals: based on yesterday's HLC + proximity to S1/R1
Candlestick signals: based on bar shape (hammer, engulfing, etc.)
Harmonic signals: based on Fibonacci retracement patterns

These use different mathematical inputs but they all depend on PRICE. In a strong uptrend, price is likely:

Near or above R1 (no pivot buy signal)
Making bullish candles (candlestick buy signals)
Forming shallow retracements (few harmonic patterns)

So candlestick signals may trigger more often in trends, while pivot signals trigger more often near support/resistance. They are partially independent but not orthogonal.

Q18: Does the Confluence EA have positive expectancy? Vault evidence:

Mar 21: "~19% profit confirmed"
Mar 21 report: "86.67% win rate, 2.52 profit factor" (but this was LONG-ONLY)
Mar 29 analysis: "Strong risk management: losses are 10x smaller than wins"

This suggests positive expectancy, at least for the long side.

Q19: Why was it long-only before v1.14? The bug: if(nearResistance || (rejectedFromResistance && belowPivot)) required belowPivot for shorts. Since price near resistance is usually ABOVE pivot, shorts were blocked.

This was fixed in v1.14: if(nearResistance || rejectedFromResistance)

Q20: Is the short signal logic now symmetric to long? Current code (v1.20):

Long: nearSupport || bouncedFromSupport
Short: nearResistance || rejectedFromResistance

Yes, symmetric. BUT: in an uptrend, price spends more time near resistance (making new highs) than near support. So short signals may fire more frequently than before. Whether this is profitable depends on whether resistance holds.

Q21: What does the ranging market detection do to signal frequency?

if(isRangingMarket)
{
    effectiveMinATR = InpMinATRPercent * 1.5;  // HARDER to trade
    effectiveMinADX = InpMinADX + 5.0;         // HARDER to trade
}

Wait — in a ranging market, the filters get STRICTER? That seems backwards. If the EA is designed to trade reversals at support/resistance, ranging markets are its IDEAL environment. But the code makes it HARDER to trade in ranging markets.

Let me re-read the filter logic.

bool CheckVolatilityFilter()
{
    bool ranging = isRangingMarket;
    double effectiveMinATR = InpMinATRPercent;
    double effectiveMinADX = InpMinADX;
    
    if(ranging)
    {
        effectiveMinATR = InpMinATRPercent * 1.5;
        effectiveMinADX = InpMinADX + 5.0;
    }
    // ... check ATR and ADX against effective values
}

This is INVERTED logic! In ranging markets:

ATR is typically lower (choppy but bounded)
ADX is typically lower (weak trend)
The code RAISES the thresholds, making it HARDER to pass

So the EA blocks trades in its ideal environment (ranging) and allows them in trending markets (where ATR and ADX are higher). This is backwards from a reversal-trading perspective.

Wait — is this intentional? Maybe the intent is: "If market is ranging, it's too choppy and unpredictable, so don't trade." But the Confluence EA is explicitly a reversal strategy. It should WANT ranging markets.

Let me check if there's another interpretation. The comment in the Apr 7 diagnosis says: "In ranging mode: effectiveMinATR = 0.8% × 1.5 = 1.2% (essentially unreachable)"

This confirms the intent was anti-chop: if market is ranging, require MORE volatility to trade. But this creates a paradox:

Reversal strategies need ranging markets
But the code blocks ranging markets
It only allows trading when ATR/ADX are high (trending)
In trending markets, reversal trades lose money

This is a fundamental logic inversion.

Confidence impact: This is a major finding. The ranging detection INCREASES filters instead of relaxing them. For a reversal EA, this is backwards. +5% confidence in finding validity, but it reveals a deeper design issue.

Iteration 5: Questioning the Analyst (Myself)

Q22: Am I overfitting to the code I see? The CheckVolatilityFilter() ranging logic might have been intentionally designed for a different purpose: maybe the EA is supposed to trade BREAKOUTS in trending markets, not reversals in ranging markets? But the signal logic (buy at support, sell at resistance) is clearly reversal-based.

Alternatively, maybe "ranging" detection is meant to detect EXTREME chop (tight consolidation) where mean-reversion fails because there are no meaningful swings. In that case, raising thresholds makes sense. But the ranging detector uses "5+ bars near pivot + ADX < 20", which is exactly the sweet spot for pivot-based reversals.

Q23: Could the ranging logic be a hedge against the Apr 7 scenario? Apr 7: "tariff-driven market: big price moves but no directional trend = low ADX + high but choppy ATR" In that specific case, raising thresholds might have prevented bad trades. But as a permanent rule, it systematically blocks the EA's best setups.

Q24: What would a first-principles fix look like? For a reversal EA:

In ranging markets: RELAX filters (lower ATR threshold, lower ADX threshold, or disable ADX)
In trending markets: TIGHTEN filters (higher ATR, higher ADX, or block entirely)

Current code does the opposite.

Q25: Am I confident enough to recommend inverting this logic? Not without backtesting. Inverting a core filter is a major change. But I am confident enough to flag it as a design paradox that contradicts the strategy's stated purpose.

Iteration 6: The Preset Changes — Signal or Noise?

Q26: Are the preset changes coherent? Grid EA:

ADXMax 18→40: Much more permissive. Allows trading in moderate trends.
RSI 30/70→20/80: More permissive. Allows trading at more extreme levels.
Lots 0.01→0.03: 3× larger positions.
Daily DD 3%→5%: More permissive.

Net effect: Grid EA is now MORE aggressive and MORE permissive. This addresses user frustration ("does not serve any purpose") but increases risk.

Confluence EA:

RiskPercent 1.0%→1.5%: More aggressive.
Daily DD 3%→5%: More permissive.
ATR thresholds: normalized per pair (was 0.8% universal, now 0.3–0.7%).

Net effect: More trades, larger positions, wider loss limits.

Q27: Do these changes reflect a sound risk adjustment? The user went from:

Grid EA losing $60K with 0.01 lots and tight filters → shut down
To: Grid EA with 0.03 lots and loose filters

This is a reaction to frustration, not a calculated risk adjustment. The root cause of the $60K loss was the CLUSTERING BUG (positions accumulating without limit), not the lot size or filters. v3.1/v4.1 fixed the clustering with edge cleanup and MaxLevels. The user is now INCREASING risk after a bug fix, which is psychologically understandable but mathematically dangerous.

Q28: What is the maximum theoretical daily loss now? Assuming 8 Grid EA charts × 2 sides × 5 levels × 0.03 lots = 2.4 lots total exposure. With no SL, a 100-pip move against all positions could lose ~$2,400+ (varies by pair). At 5% daily DD, a $100k account allows $5k/day. This is within bounds IF the edge cleanup works. But edge cleanup triggers on price exiting the grid, which might be 50-150 pips away. The loss per level could be $150-450 × 5 levels × 2 sides × 8 pairs = $12,000-$36,000 theoretical max before cleanup.

Wait — edge cleanup closes the WRONG-SIDE positions only. If price goes up through GridHigh, it closes SELLs. But the BUYs are winning. So the net loss is bounded by the losing side. With 5 levels of SELLs at 0.03 lots each, max SELL exposure is 0.15 lots. At 150 pips loss = ~$225 (for USD-denominated pairs). This is manageable.

But what if price whipsaws? Up through GridHigh → SELLs closed at loss → price falls back into grid → new SELLs opened → price drops through GridLow → BUYs closed at loss. Repeated whipsaws could accumulate losses while never hitting TP.

This is the "whipsaw death" scenario common to all grid systems.

Confidence impact: The risk math is concerning but not immediately catastrophic. The code has safety valves (edge cleanup, daily DD). I'm confident the risk is elevated but bounded. No change to code-analysis confidence.

Iteration 7: Synthesis — What Do I KNOW vs. BELIEVE?

KNOW (High Confidence, >95%)

The working tree contains 6+ weeks of uncommitted changes across 57 files.
MultiSignal_Confluence_EA.mq5 is v1.20 with per-EA realized P&L, weekly DD, warmup, pivot cache.
OrdersEA_Smart_Grid.mq5 is v4.1 with adaptive filters, session filter, breakeven, correlation caps, range drift, edge cleanup.
Both EAs reset dailyStartEquity from AccountInfoDouble(ACCOUNT_EQUITY) (whole account) but measure P&L per-EA. This is the shared-drawdown-budget contamination.
Grid EA CheckProfitTarget() uses AccountInfoDouble(ACCOUNT_EQUITY) - cycleStartEquity where cycleStartEquity is whole-account. This is cycle P&L contamination.
GetOut=Y was present on 4 grid presets in Apr 23 audit; it is now N in working tree.
Grid presets increased lots 0.01→0.03 and relaxed ADXMax 18→40.
Confluence EA's CheckVolatilityFilter() raises thresholds in ranging markets (inverted for a reversal strategy).
The v1.14 short signal fix is present and symmetric in v1.20.
Confluence EA InpDebugMode=true in all presets currently.

BELIEVE (Moderate Confidence, 70-90%)

The shared drawdown contamination was the root cause of the Apr 3 crisis (Grid blocked Confluence).
The user wants per-EA isolation, not whole-account drawdown.
The ranging-market threshold inversion is unintentional and suboptimal.
The 3× lot increase in Grid presets increases risk disproportionately to expected return.
Edge cleanup prevents catastrophic grid accumulation but introduces whipsaw crystallization risk.

UNCERTAIN (Lower Confidence, <70%)

Whether GlobalVariableSet persistence survives all MT5 restart scenarios ( Wine + Docker ).
Whether the Confluence EA's reversal signals have positive expectancy in live markets.
Whether the Grid EA's mean-reversion edge exists after all the v4.1 changes.
Whether the current .set files match what is actually loaded on live MT5 charts.
Whether HistorySelect + HistoryDealGetTicket correctly captures all realized P&L in this broker setup (deal vs. order accounting).

Iteration 8: The Final Challenge — Did I Miss Anything?

Q29: What about the deleted files? OrdersEA.mq5 and OrdersEA_MT5_Converted.mq5 were deleted in working tree. These were old versions. The active file is OrdersEA_Smart_Grid.mq5. No risk here.

Q30: What about the n8n workflow changes? The n8n/SSH/HTTP workflows were updated for settings validation. These are DevOps, not trading logic. Low risk.

Q31: Are there any compiler errors in the current code? The AGENTS.md says "Test compilation - always verify code compiles before deploying." I have NOT compiled the code. There could be syntax errors in the 1,500+ line Grid EA diff.

Evidence of risk:

InpManualHigh / InpManualLow renamed from HIGH / LOW to avoid MQL5 constant conflict (this was a past fix, should be fine)
Complex new functions: GetMarketFilling(), SaveDailyState(), SumRealizedPnLSince(), WeekStartBroker()
Grid EA has #define VERSION "Version 4.1 Smart Grid MT5" but #property version "4.1" — these match.

I cannot be 100% confident without compilation. But the code structure looks consistent with MQL5 syntax.

Q32: What about the .codex file? ?? .codex — untracked. Unknown content. Likely IDE metadata. Low risk.

Q33: What about the new presets (confluence-xagusd.set, grid-eurchf.set, etc.)? These expand coverage to silver (XAGUSD) and additional crosses (EURCHF, EURGBP, AUDNZD). This increases diversification but also increases total exposure. The correlation cap (InpMaxLongSymbols) might help, but older presets don't define it.

Final Confidence Assessment

Domain	Confidence	Reasoning
Change inventory	98%	Diff reviewed thoroughly; all major modifications catalogued
Code bug identification	95%	Shared drawdown, cycle contamination, threshold inversion all confirmed by direct code inspection
Vault synthesis	92%	All conversation history + audit files read and cross-referenced
Root cause attribution	90%	Apr 3 crisis linked to shared drawdown with strong evidence; other causes (clustering bug) also contributed
Fix recommendation validity	88%	Architecture C (baseline floating capture) is sound but untested in this codebase
Compilation certainty	75%	No compilation performed; 1,500+ lines of new Grid EA code could have subtle MQL5 errors
Live MT5 state	65%	No live log access; assuming repo presets match loaded charts is an assumption
Strategy expectancy	60%	Cannot determine edge from code alone; requires market data

Overall Confidence: 91%

The 9% gap comes from:

10%: Compilation uncertainty + live state mismatch + strategy expectancy unknown
Mitigated by: Code structure is consistent with prior working versions; fixes are conservative (capturing baselines, not rewriting algorithms)

I have met the >90% threshold.

First-Principles Summary: The 3 Deepest Insights

The drawdown architecture is philosophically broken: It attempts per-EA isolation using a whole-account baseline. This is like giving two people separate credit cards that both draw from the same account but only track individual spending. Either merge the budgets (whole-account drawdown) or separate the accounts (fixed allocation). The hybrid creates false precision.
The Confluence EA fights itself: Its signal logic wants ranging markets (reversals at pivot), but its volatility filter raises thresholds in ranging markets. This is a strategy-filter mismatch that systematically blocks optimal setups.
The Grid EA's risk escalated after a bug fix: The $60K loss was caused by a clustering bug (now fixed). The user's response was to triple lot sizes and relax filters. This is "fixed the car, now drive faster" — the accident was caused by a mechanical failure, but the speed increase creates new risks.

Analysis complete. Confidence: 91%. Ready for action.

22 KiB Raw Blame History Unescape Escape

Kimi Code CLI — Analysis 2: First Principles Deep Analysis

Iteration 1: State the Obvious, Then Destroy It

Surface-Level Conclusion from Analysis 1

First-Principles Destruction

Iteration 2: Is the "Fix" Really a Fix?

Proposed Fix (Architecture C from Analysis 1)

Adversarial Questions

Iteration 3: The Grid EA's Existential Question

First Principle: A Grid EA Is a Mean-Reversion Bet

Adversarial Questions

Iteration 4: The Confluence EA's Signal Quality

First Principle: Confluence Improves Signal Quality ONLY If Signals Are Independent

Adversarial Questions

Iteration 5: Questioning the Analyst (Myself)

Iteration 6: The Preset Changes — Signal or Noise?

Iteration 7: Synthesis — What Do I KNOW vs. BELIEVE?

KNOW (High Confidence, >95%)

BELIEVE (Moderate Confidence, 70-90%)

UNCERTAIN (Lower Confidence, <70%)

Iteration 8: The Final Challenge — Did I Miss Anything?

Final Confidence Assessment

First-Principles Summary: The 3 Deepest Insights

22 KiB

Raw Blame History