
Documentation
❤️System Health Monitoring
Real-time monitoring, diagnostics, and health checks to ensure your AI system runs at peak performance
← Back to DocumentationAlways-On System Health
Monitor every aspect of your Hive AI system in real-time. Track API health, model availability, performance metrics, and get instant alerts when issues arise. Keep your AI infrastructure running smoothly.
System Status
All Systems Operational
99.9%
API Uptime
320+
Models Online
45ms
Avg Response
0
Active Incidents
🔌 API Health
Provider connections, response times, error rates
🤖 Model Status
Availability, performance, rate limits
💾 System Resources
Database, storage, memory usage
🧪 CLI Usage
✅ API Connection: Healthy
✅ License: Valid (Enterprise)
✅ Models: 320/320 available
✅ Database: Connected
✅ Overall Status: Operational
Detailed Diagnostics
# Comprehensive health report
hive health --detailed
# Check specific component
hive health --component database
🔧 IDE Integration (Claude Code, Cursor, Windsurf)
Basic Health Check
// MCP Tool Call
hive_health({
component: "all",
detailed: false
})
Quick system status check
Detailed Report
// MCP Tool Call
hive_health({
component: "all",
detailed: true
})
Comprehensive diagnostics
Component Check
// MCP Tool Call
hive_health({
component: "providers",
detailed: true
})
Check specific components
Performance Stats
// MCP Tool Call
hive_health({
component: "performance",
detailed: true
})
Performance metrics
📊 Health Metrics Explained
🔌 API Health Metrics
- Connection Status: Active/Failed/Retry
- Response Time: Average latency in ms
- Error Rate: Failed requests percentage
- Rate Limits: Usage vs available quota
Healthy Range:
• Response: <100ms
• Error Rate: <1%
• Uptime: >99.5%
🤖 Model Health Metrics
- Availability: Online/Offline status
- Performance: Token generation speed
- Queue Time: Wait time for requests
- Success Rate: Completed vs failed
Monitoring Tips:
• Watch for queue spikes
• Compare model response rates
• Track model availability
💾 System Resource Metrics
- Database Health: Connections, latency, error rate
- Storage: Capacity, IOPS, throughput
- Memory: Usage, swap activity
- CPU: Load average, idle time
Resource Alerts:
• Memory usage above 85%
• Disk space below 10%
• DB latency spikes
❤️ Healthy Systems, Confident Teams
Keep your AI systems running smoothly with real-time monitoring, actionable diagnostics, and proactive alerts.