DadsCloud System Status

Real-time monitoring of all DadsCloud services and infrastructure components.

check_circle All Systems Operational

Last updated: January 8, 2025 at 14:23 IST
Updates every 30 seconds

dns Core Infrastructure

computer Proxmox Cluster

Operational

3-node high availability virtualization cluster providing compute resources for all virtual machines.

99.98%

30-day uptime

89ms

Avg response time

15,420

Requests/hour

0.01%

Error rate

payment Billing & DadsPoints

Operational

Payment processing, billing management, and DadsPoints credit system.

99.96%

30-day uptime

156ms

Avg response time

342

Transactions/day

100%

Success rate

public External Services

security Cloudflare Tunnels

Operational

Secure global access tunnels providing DDoS protection and content delivery.

99.99%

30-day uptime

28ms

Global latency

2.1TB

Data transferred

Blocked attacks

support_agent Support Portal

Operational

24/7 technical support portal and ticketing system for customer assistance.

99.92%

30-day uptime

2.3min

Avg response time

Open tickets

98.2%

Satisfaction rate

monitor Monitoring & Alerting

Operational

Real-time infrastructure monitoring and automated alerting systems.

100%

30-day uptime

45ms

Check interval

1,247

Active monitors

Alerts today

Scheduled Maintenance

Upcoming maintenance windows and planned service updates to ensure optimal performance.

Proxmox Cluster Security Updates

Jan 15, 2025 • 02:00-04:00 IST

Routine security updates and patches will be applied to all Proxmox cluster nodes. This maintenance includes kernel updates, security patches, and system optimizations. Services will remain available through live migration capabilities.

schedule Duration: 2 hours

impact Impact: No service interruption expected

category Services: All Enterprise & Learning VMs

info Type: Security Updates

Network Equipment Firmware Update

Jan 22, 2025 • 01:30-03:00 IST

Firmware updates for core network switches and routers to improve performance and add new features. Brief connectivity interruptions may occur during switch failover processes.

schedule Duration: 1.5 hours

impact Impact: Brief network interruptions (<2 min)

category Services: All network-dependent services

info Type: Firmware Update

TrueNAS Storage Optimization

Feb 5, 2025 • 03:00-05:00 IST

Storage system optimization including disk defragmentation, ZFS pool scrubbing, and performance tuning. All data remains accessible during this maintenance window.

schedule Duration: 2 hours

impact Impact: Slightly reduced I/O performance

category Services: All storage-dependent services

info Type: Performance Optimization

Recent Incidents

Transparency in our incident response and resolution process.

API Response Time Degradation

Resolved

Customers experienced increased API response times affecting console performance and automated deployments. The issue was identified as a database connection pool exhaustion during peak usage.

Jan 3, 14:23 IST

Incident detected: Automated monitoring systems triggered alerts for elevated API response times averaging 2.3 seconds.

Jan 3, 14:28 IST

Investigation started: Engineering team began investigating database performance and connection pooling metrics.

Jan 3, 14:45 IST

Root cause identified: Database connection pool exhaustion during peak afternoon usage. Temporary fix applied by increasing pool size.

Jan 3, 15:12 IST

Service restored: API response times returned to normal levels (under 200ms average).

Jan 3, 16:30 IST

Permanent fix deployed: Implemented dynamic connection pool scaling and improved database query optimization.

Partial Console Outage

Resolved

A subset of users experienced difficulty accessing the customer console due to a load balancer configuration issue affecting one of our frontend servers.

Dec 28, 09:15 IST

Incident reported: Multiple customer reports of console login failures and timeout errors.

Dec 28, 09:22 IST

Investigation started: Identified load balancer routing issue affecting approximately 30% of console traffic.

Dec 28, 09:35 IST

Workaround implemented: Temporarily removed problematic server from load balancer pool to restore service.

Dec 28, 10:45 IST

Root cause fixed: Corrected misconfigured health check parameters and restored full load balancer functionality.

Storage Performance Impact

Resolved

Customers experienced slower disk I/O performance on some virtual machines due to a failing drive in one of our RAID arrays.

Dec 22, 11:30 IST

Performance degradation detected: Automated monitoring identified increased storage latency on RAID pool #3.

Dec 22, 11:45 IST

Hardware inspection: Physical inspection revealed early warning signs of drive failure. Initiated RAID rebuild process.

Dec 22, 12:00 IST

Proactive replacement: Hot-swapped failing drive with spare unit. RAID rebuild started automatically.

Dec 22, 16:20 IST

Performance restored: RAID rebuild completed successfully. All storage performance metrics returned to normal.

DadsCloud System Status

dns Core Infrastructure

public External Services

Scheduled Maintenance

Proxmox Cluster Security Updates

Network Equipment Firmware Update

TrueNAS Storage Optimization

Recent Incidents

API Response Time Degradation

Partial Console Outage

Storage Performance Impact

Stay Informed