Why Trading Performance Metrics Lie (And Which Ones Actually Don't)
Annual returns look great until you understand what they hide. Here are the trading performance metrics that actually tell you whether a strategy is working.
The Number Everyone Shows You (And Why It's Wrong)
Every trading platform, fund, and strategy seller leads with the same number: annual return. "Generated 34% in 2025." "Beat the market by 12 points." It's the first metric, the biggest font, and the one that drives the most decisions.
It's also one of the least useful metrics for evaluating trading performance.
Annual return tells you the output, but nothing about the process, the risk taken, the smoothness of the path, or whether the performance will persist. Two strategies can have identical annual returns while being completely different in terms of how you'd actually experience trading them.
Here's a guide to the metrics that actually matter — and how to read them correctly.
What You Should Actually Care About
Risk-Adjusted Return (Sharpe Ratio)
The Sharpe ratio divides excess return by volatility. It answers the question: how much return are you getting per unit of risk taken?
Formula: (Portfolio Return − Risk-Free Rate) / Portfolio Standard Deviation
A Sharpe ratio of 1.0 means you're getting 1% of return for every 1% of annualized volatility. A ratio of 2.0 means you're getting twice the return per unit of risk. Generally:
- Below 0.5: Poor
- 0.5–1.0: Acceptable
- 1.0–1.5: Good
- Above 1.5: Strong (verify it's not overfitted)
- Above 2.0: Exceptional or suspicious
The Sharpe ratio is far more informative than raw return because it penalizes strategies that take excessive risk to generate those returns. A 30% return with a Sharpe of 0.3 is less impressive than a 15% return with a Sharpe of 1.5.
The limitation: Sharpe penalizes all volatility equally, including upside volatility. A strategy with huge winners and small losers will look worse on Sharpe than it actually is.
Sortino Ratio: The Better Version
The Sortino ratio is like Sharpe but only penalizes downside volatility — the bad kind. Upside volatility (big winning trades) doesn't hurt your Sortino score.
This is more aligned with what traders actually care about. You don't mind when your strategy makes more than expected. You mind when it loses more than expected.
For strategies with asymmetric return profiles (like options strategies or trend-following), the Sortino ratio is often more informative than Sharpe.
Maximum Drawdown
Maximum drawdown — the largest peak-to-trough decline over the measurement period — is the metric that tells you what you actually have to live through.
Here's why it matters more than most people realize: humans have a highly nonlinear relationship with drawdowns. A 10% drawdown is uncomfortable but manageable. A 20% drawdown causes many traders to question the strategy. A 30% drawdown causes many to abandon it entirely — often right before recovery.
This behavioral reality means that a strategy with a 30% maximum drawdown has effectively zero value for most traders, even if the backtest shows it recovers and performs well over time. If you won't stick with it through the drawdown, the long-run returns are irrelevant.
Evaluating maximum drawdown isn't just a risk assessment — it's a behavioral compatibility test.
Calmar Ratio: Return per Drawdown
The Calmar ratio divides annualized return by maximum drawdown. It answers: how much return are you getting per unit of worst-case drawdown experienced?
Formula: Annualized Return / Maximum Drawdown (absolute value)
A Calmar ratio of 1.0 means your annual return equals your maximum drawdown — for every 10% you made annually, you also experienced a 10% drawdown at some point.
A Calmar of 2.0 or higher is considered strong. Higher Calmar ratios indicate strategies that generate good returns with relatively contained drawdowns — the combination that makes strategies actually tradeable.
This is arguably the single most important ratio for practical trading evaluation.
Win Rate and Profit Factor
Win rate — the percentage of trades that are profitable — is nearly useless on its own. A win rate of 70% can mean a losing strategy. A win rate of 35% can mean a winning one.
What matters is the combination of win rate and payoff ratio (average win / average loss):
Expected Value = Win Rate × Average Win − (1 − Win Rate) × Average Loss
The metric that captures this cleanly is profit factor: total gross profit divided by total gross loss. A profit factor above 1.0 means the strategy is profitable in aggregate. Above 1.5 is good. Above 2.0 is strong.
The other useful metric here is expectancy: the average dollar profit or loss per trade. Positive expectancy, sized appropriately, compounds into good outcomes over time.
Recovery Time and Drawdown Duration
Often overlooked: how long does it take to recover from drawdowns? A strategy that experiences a 15% drawdown but recovers in 3 weeks is very different from one that takes 8 months to recover.
Long drawdown duration creates real-world problems: impatience, strategy abandonment, missed opportunities in other assets. A strategy with short recovery times is more practically tradeable even if the max drawdown depth is similar.
Track your average drawdown duration alongside maximum depth.
The Metrics That Tell You About Robustness
Beyond point-in-time performance metrics, you want to understand whether the performance is robust or fragile.
Out-of-sample performance: How does the strategy perform on data it wasn't trained on? A large gap between in-sample and out-of-sample performance indicates overfitting.
Parameter sensitivity: How much does performance change if you slightly adjust the strategy's parameters? Robust strategies perform similarly across a reasonable range of parameter values. Fragile strategies perform great at one specific parameter setting and poorly on either side.
Performance across market regimes: Does the strategy work in both bull and bear markets? In high-VIX and low-VIX environments? Strategies that only work in one specific regime are time-limited.
Trade frequency: A strategy with very few trades per year has high statistical uncertainty — you don't have enough data points to draw confident conclusions. More trades, within reason, give you more statistical confidence in the results.
The Number That Nobody Shows You
Here's the metric that marketing never shows but sophisticated investors always ask for: underwater curve — the percentage of time the strategy was in drawdown at any given point.
A strategy can have a modest maximum drawdown but be underwater 70% of the time. That means 70% of the time you're waiting for recovery, not enjoying new highs. For most people, this is psychologically unsustainable.
The underwater curve tells you what the day-to-day experience of trading a strategy actually looks like. It's a reality check that annual returns and Sharpe ratios don't provide.
How to Use These Metrics Together
No single metric is sufficient. Here's a practical evaluation framework:
- Start with Calmar ratio — is the return/drawdown tradeoff acceptable?
- Check Sharpe or Sortino — is the risk-adjusted return competitive?
- Look at maximum drawdown depth and duration — can you actually live through this?
- Check out-of-sample performance — is it real or overfitted?
- Examine performance by regime — will it work in different market environments?
If a strategy passes all five tests, it's worth serious consideration. Most strategies fail somewhere in this list.
See why consistency beats prediction in trading →
What Lukra Reports
Lukra publishes Sharpe, Calmar, maximum drawdown, and monthly performance breakdowns for each active strategy. Not because these numbers are perfect, but because they give you the full picture — not just the headline number that makes us look good.
If a strategy is struggling, that will show in the metrics. If it's performing well, the metrics will confirm it across multiple dimensions.
That's what transparency looks like in systematic trading.
Past performance is not indicative of future results. All trading involves risk of loss. This content is for educational purposes only.
Ready to see AI-driven trading in action?
View Live Performance