OpenDeRisk AI-Native Risk Intelligence Systems —— Your application system risk intelligent manager provides 7 * 24-hour comprehensive and in-depth protection.
- [2025/10] 🔥 We released OpenDerisk v0.2. OpenDerisk V0.2 ReleaseNote
- DeepResearch RCA: Quickly locate the root cause of issues through in-depth analysis of logs, traces, and code.
- Visualized Evidence Chain: Fully visualize the diagnostic process and evidence chain, making the diagnosis clear and enabling quick judgment of accuracy.
- Multi-Agent Collaboration: Collaboration among SRE-Agent, Code-Agent, ReportAgent, Vis-Agent, and Data-Agent.
- Open and Open-Source Architecture: OpenDeRisk is built with a completely open and open-source architecture, allowing related frameworks and code to be used out of the box in open-source projects.
The system adopts a multi-agent architecture. Currently, the code mainly implements the green-highlighted parts. Alert awareness is based on Microsoft's open-source OpenRCA dataset. The dataset size is approximately 26GB after decompression. On this dataset, we achieve root cause analysis and diagnosis through multi-agent collaboration, where the Code-Agent dynamically writes code for final analysis.
Data Layer: Pull the large-scale OpenRCA dataset (20GB) from GitHub, decompress it locally, and process it for analysis.
Logic Layer: Multi-agent architecture, with collaboration among SRE-Agent, Code-Agent, ReportAgent, Vis-Agent, and Data-Agent to perform in-depth DeepResearch RCA (Root Cause Analysis).
Visualization Layer: Use the Vis protocol to dynamically render the entire processing flow and evidence chain, as well as the process of multi-role collaboration and switching.
Digital Employees (Agents) in OpenDeRisk
Install uv
curl -LsSf https://astral.sh/uv/install.sh | sh
uv sync --all-packages --frozen \
--extra "base" \
--extra "proxy_openai" \
--extra "rag" \
--extra "storage_chromadb" \
--extra "derisks" \
--extra "storage_oss2" \
--extra "client" \
--extra "ext_base"
Configure the API_KEY in the derisk-proxy-aliyun.toml
file, then run the following command to start.
Note: By default, we use the Telecom dataset from the OpenRCA dataset. You can download it via the link or the following command:
gdown https://drive.google.com/uc?id=1cyOKpqyAP4fy-QiJ6a_cKuwR7D46zyVe
After downloading, move the datasets to the path pilot/datasets/
Run the startup command:
uv run python packages/derisk-app/src/derisk_app/derisk_server.py --config configs/derisk-proxy-aliyun.toml
Open your browser and visit http://localhost:7777
- AI-SRE(OpenRCA)
- !Notice, We Use the OpenRCA DatasetsBank Dataset,
- You can download the dataset using next link:
gdown https://drive.google.com/uc?id=1enBrdPT3wLG94ITGbSOwUFg9fkLR-16R
- Put the datasets to the path ${derisk}/pilot/datasets。
- Flame Graph Assistant
- Upload the flame graph (Java/Python) of your local application service process to the assistant for analysis and inquiries.
- DataExpert
- Upload your metrics, logs, traces, or various Excel data sheets for conversational analysis.
- Agent Development
Refer to the implementation logic under
derisk-ext.agent.agents
. - Tool Development
- Local tool
- MCP
- Other Development Documentation is under preparation...
As shown in the figure below, this demonstrates a scenario where multiple agents collaborate to handle a complex operational diagnostic task.
-
0530 V0.1 Version: Based on domain knowledge and MCP services, achieving anomaly awareness -> autonomous decision-making -> adaptive execution and issue resolution.
- Domain knowledge engine for technical risks
- Reasoning engine driven by large models for anomaly awareness -> decision-making -> execution
- Automated troubleshooting and fixes
-
0830 V0.2 Version
- MCP services and management for technical risks
- Support for custom binding of knowledge and MCP tools
- Support for 3+ DevOps domain MCP services
-
0930 V0.3 Version
- Support for integration with production environments
- Provide a complete production environment deployment solution, supporting production issue diagnosis.
-
1230 V0.4 Version
- End-to-end AIOps online Agentic RL
- End-to-end evaluation capabilities
The code (training, serving, and evaluation) in this repository is mostly developed for or derived from the paper below. Please cite it if you find the repository helpful.
@misc{di2025openderiskindustrialframeworkaidriven,
title={OpenDerisk: An Industrial Framework for AI-Driven SRE, with Design, Implementation, and Case Studies},
author={Peng Di and Faqiang Chen and Xiao Bai and Hongjun Yang and Qingfeng Li and Ganglin Wei and Jian Mou and Feng Shi and Keting Chen and Peng Tang and Zhitao Shen and Zheng Li and Wenhui Shi and Junwei Guo and Hang Yu},
year={2025},
eprint={2510.13561},
archivePrefix={arXiv},
primaryClass={cs.SE},
url={https://arxiv.org/abs/2510.13561},
}
The OpenDeRisk-AI community is dedicated to building AI-native risk intelligence systems. 🛡️ We hope our community can provide you with better services, and we also hope that you can join us to create a better future together. 🤝
Join our networking group on Dingding and share your experience with other developers!