DadsCloud System Status

Real-time monitoring of all DadsCloud services and infrastructure components.

check_circle All Systems Operational
Last updated: January 8, 2025 at 14:23 IST
Updates every 30 seconds

dns Core Infrastructure

computer Proxmox Cluster
Operational
3-node high availability virtualization cluster providing compute resources for all virtual machines.
99.98%
30-day uptime
89ms
Avg response time
15,420
Requests/hour
0.01%
Error rate
payment Billing & DadsPoints
Operational
Payment processing, billing management, and DadsPoints credit system.
99.96%
30-day uptime
156ms
Avg response time
342
Transactions/day
100%
Success rate

public External Services

security Cloudflare Tunnels
Operational
Secure global access tunnels providing DDoS protection and content delivery.
99.99%
30-day uptime
28ms
Global latency
2.1TB
Data transferred
18
Blocked attacks
support_agent Support Portal
Operational
24/7 technical support portal and ticketing system for customer assistance.
99.92%
30-day uptime
2.3min
Avg response time
47
Open tickets
98.2%
Satisfaction rate
monitor Monitoring & Alerting
Operational
Real-time infrastructure monitoring and automated alerting systems.
100%
30-day uptime
45ms
Check interval
1,247
Active monitors
2
Alerts today

Scheduled Maintenance

Upcoming maintenance windows and planned service updates to ensure optimal performance.

Proxmox Cluster Security Updates

Jan 15, 2025 • 02:00-04:00 IST

Routine security updates and patches will be applied to all Proxmox cluster nodes. This maintenance includes kernel updates, security patches, and system optimizations. Services will remain available through live migration capabilities.

schedule Duration: 2 hours
impact Impact: No service interruption expected
category Services: All Enterprise & Learning VMs
info Type: Security Updates

Network Equipment Firmware Update

Jan 22, 2025 • 01:30-03:00 IST

Firmware updates for core network switches and routers to improve performance and add new features. Brief connectivity interruptions may occur during switch failover processes.

schedule Duration: 1.5 hours
impact Impact: Brief network interruptions (<2 min)
category Services: All network-dependent services
info Type: Firmware Update

TrueNAS Storage Optimization

Feb 5, 2025 • 03:00-05:00 IST

Storage system optimization including disk defragmentation, ZFS pool scrubbing, and performance tuning. All data remains accessible during this maintenance window.

schedule Duration: 2 hours
impact Impact: Slightly reduced I/O performance
category Services: All storage-dependent services
info Type: Performance Optimization

Recent Incidents

Transparency in our incident response and resolution process.

API Response Time Degradation

Resolved

Customers experienced increased API response times affecting console performance and automated deployments. The issue was identified as a database connection pool exhaustion during peak usage.

Jan 3, 14:23 IST
Incident detected: Automated monitoring systems triggered alerts for elevated API response times averaging 2.3 seconds.
Jan 3, 14:28 IST
Investigation started: Engineering team began investigating database performance and connection pooling metrics.
Jan 3, 14:45 IST
Root cause identified: Database connection pool exhaustion during peak afternoon usage. Temporary fix applied by increasing pool size.
Jan 3, 15:12 IST
Service restored: API response times returned to normal levels (under 200ms average).
Jan 3, 16:30 IST
Permanent fix deployed: Implemented dynamic connection pool scaling and improved database query optimization.

Partial Console Outage

Resolved

A subset of users experienced difficulty accessing the customer console due to a load balancer configuration issue affecting one of our frontend servers.

Dec 28, 09:15 IST
Incident reported: Multiple customer reports of console login failures and timeout errors.
Dec 28, 09:22 IST
Investigation started: Identified load balancer routing issue affecting approximately 30% of console traffic.
Dec 28, 09:35 IST
Workaround implemented: Temporarily removed problematic server from load balancer pool to restore service.
Dec 28, 10:45 IST
Root cause fixed: Corrected misconfigured health check parameters and restored full load balancer functionality.

Storage Performance Impact

Resolved

Customers experienced slower disk I/O performance on some virtual machines due to a failing drive in one of our RAID arrays.

Dec 22, 11:30 IST
Performance degradation detected: Automated monitoring identified increased storage latency on RAID pool #3.
Dec 22, 11:45 IST
Hardware inspection: Physical inspection revealed early warning signs of drive failure. Initiated RAID rebuild process.
Dec 22, 12:00 IST
Proactive replacement: Hot-swapped failing drive with spare unit. RAID rebuild started automatically.
Dec 22, 16:20 IST
Performance restored: RAID rebuild completed successfully. All storage performance metrics returned to normal.