How we evaluate crypto portfolio strategies
Guardrails v1 — Risk-First Portfolio Control
Summary of core risk metrics (Max Drawdown, ES, Sharpe/Sortino) across rolling out-of-sample windows. Includes stress injections (gaps, liquidity crunch) to test envelope bounds.
Download leaderboard (CSV) →
Figures →
Baselines & Comparators
Reference implementations for BTC buy-and-hold, equal-weight top-N, and a public risk-parity proxy. These provide context for drawdown, volatility, and turnover versus our guardrailed approach.
How to Read the Leaderboards
Key metrics
- AR / AV: Annualized return and volatility
- Sharpe / Sortino: Risk-adjusted performance (rf≈0)
- Max Drawdown / ES(95): Tail risk and worst-case paths
- Turnover / Shortfall: Implementation realism under costs and slippage
Slices & windows
We report rolling out-of-sample windows (e.g., 2019–2021, 2021–2023, 2024-2025) to avoid look-ahead. Each slice includes costs, exchange outages, funding, and depeg filters where applicable.
Methods at a Glance
What we disclose
Mechanisms and evaluation protocol: universe eligibility, rebalance cadence, guardrail triggers (vol targeting, drawdown circuits), stress scenarios, and metric definitions.
What we withhold
Thresholds, hyperparameters, feature recipes, and model weights remain confidential. Benchmarks focus on outcomes and guardrail effectiveness.
Reproduce the Results
One-command run
Each suite provides a short repro.sh (or notebook) that regenerates figures from a tiny sample dataset. Full datasets are hosted via DOI on Zenodo.
Steps:
- Create environment (conda or pip)
- Run repro.sh to render figures into /figures
- Compare your outputs with our published CSV and images
Datasets & Code
Datasets
- Public Eval Dataset — versioned, DOI-archived; schema and checksums provided.
Code
Evaluation & Baselines — minimal, deterministic scripts to reproduce leaderboard tables and figures.
GitHub “Cite this repository” →
Citation & Versions
Cite the suite
Isler, M., & CRYLO Research. (2025). CRYLO Research: Methods & Evaluation Kit (Version v1.0.0) [Computer software]. DOI: (https://doi.org/10.5281/zenodo.17454874).
Changelog
- v1.0.0 — Initial benchmark release (methods, CSVs, figures)
- v1.1.0 — Dataset DOI + expanded slices
- v1.1.1 — Documentation fixes (no metric changes)
Limitations & Responsible Use
Known constraints
- Exchange data quality varies; filters applied, residual noise possible
- Synthetic stress scenarios simplify liquidity dynamics
- Results are sensitive to asset eligibility and costs
Disclaimer
This benchmark is for research and evaluation. It is not investment advice.






