Documentation

❤️System Health Monitoring

Real-time monitoring, diagnostics, and health checks to ensure your AI system runs at peak performance

← Back to Documentation

Always-On System Health

Monitor every aspect of your Hive AI system in real-time. Track API health, model availability, performance metrics, and get instant alerts when issues arise. Keep your AI infrastructure running smoothly.

System Status

All Systems Operational
99.9%
API Uptime
320+
Models Online
45ms
Avg Response
0
Active Incidents

🔌 API Health

Provider connections, response times, error rates

🤖 Model Status

Availability, performance, rate limits

💾 System Resources

Database, storage, memory usage

🧪 CLI Usage

✅ API Connection: Healthy
✅ License: Valid (Enterprise)
✅ Models: 320/320 available
✅ Database: Connected
✅ Overall Status: Operational

Detailed Diagnostics

# Comprehensive health report
hive health --detailed

# Check specific component
hive health --component database

🔧 IDE Integration (Claude Code, Cursor, Windsurf)

Basic Health Check

// MCP Tool Call
hive_health({
component: "all",
detailed: false
})

Quick system status check

Detailed Report

// MCP Tool Call
hive_health({
component: "all",
detailed: true
})

Comprehensive diagnostics

Component Check

// MCP Tool Call
hive_health({
component: "providers",
detailed: true
})

Check specific components

Performance Stats

// MCP Tool Call
hive_health({
component: "performance",
detailed: true
})

Performance metrics

📊 Health Metrics Explained

🔌 API Health Metrics

  • Connection Status: Active/Failed/Retry
  • Response Time: Average latency in ms
  • Error Rate: Failed requests percentage
  • Rate Limits: Usage vs available quota
Healthy Range:
• Response: <100ms
• Error Rate: <1%
• Uptime: >99.5%

🤖 Model Health Metrics

  • Availability: Online/Offline status
  • Performance: Token generation speed
  • Queue Time: Wait time for requests
  • Success Rate: Completed vs failed
Monitoring Tips:
• Watch for queue spikes
• Compare model response rates
• Track model availability

💾 System Resource Metrics

  • Database Health: Connections, latency, error rate
  • Storage: Capacity, IOPS, throughput
  • Memory: Usage, swap activity
  • CPU: Load average, idle time
Resource Alerts:
• Memory usage above 85%
• Disk space below 10%
• DB latency spikes

❤️ Healthy Systems, Confident Teams

Keep your AI systems running smoothly with real-time monitoring, actionable diagnostics, and proactive alerts.