TrioXpert is an end-to-end framework for incident management in microservice systems that leverages multimodal data and LLM-based collaborative reasoning to handle AD, FT, and RCL tasks with high interpretability. It significantly outperforms baselines across multiple benchmarks.
May 15, 2025
FlowXpert is a troubleshooting workflow orchestration framework that uses LLMs to build an incident-aware knowledge base and applies reinforcement learning with AI feedback to improve workflow generation. Evaluated on OpsFlowBench and deployed in Huawei Cloud’s datacenter, it demonstrated effectiveness in supporting engineers and AI agents.
May 1, 2025