Some checks failed
Deploy to Phoenix / deploy (push) Has been cancelled
- ADD_CHAIN138_TO_LEDGER_LIVE: Ledger form done; public code review repo bis-innovations/LedgerLive; init/push commands - CONTRACT_DEPLOYMENT_RUNBOOK: Chain 138 gas price 1 gwei, 36-addr check, TransactionMirror workaround - CONTRACT_*: AddressMapper, MirrorManager deployed 2026-02-12; 36-address on-chain check - NEXT_STEPS_FOR_YOU: Ledger done; steps completable now (no LAN); run-completable-tasks-from-anywhere - MASTER_INDEX, OPERATOR_OPTIONAL, SMART_CONTRACTS_INVENTORY_SIMPLE: updates - LEDGER_BLOCKCHAIN_INTEGRATION_COMPLETE: bis-innovations/LedgerLive reference Co-authored-by: Cursor <cursoragent@cursor.com>
6.5 KiB
6.5 KiB
r630-02 Status Report
Last Updated: 2026-01-31
Document Version: 1.0
Status: Active Documentation
Date: 2026-01-15
Host: r630-02 (192.168.11.12)
Status: ✅ OPERATIONAL - Some Concerns
System Information
Host Details
- Hostname: r630-02
- IP Address: 192.168.11.12/24
- OS: Proxmox VE 9.1.4
- Kernel: 6.17.4-1-pve
- Uptime: 1 day, 6 hours, 40 minutes
- Architecture: x86_64
Resource Usage
- Memory: 251GB total, 8.6GB used, 235GB free, 243GB available
- CPU: Intel Xeon E5-2660 v4 @ 2.00GHz (2 sockets, 28 cores, 56 threads)
- CPU Usage: 4.9% user, 5.5% system, 89.2% idle
- Load Average: ⚠️ 12.03, 11.65, 11.51 (High - concerning)
- Swap: 0B (no swap configured)
Proxmox Status
Services
- pve-cluster: ✅ Active (running) - 1 day 6h uptime
- pvedaemon: ✅ Active (running) - 1 day 6h uptime
- pveproxy: ✅ Active (running)
Version
- Proxmox VE: 9.1.4/5ac30304265fbd8e
- Kernel: 6.17.4-1-pve
VMs and Containers
- Containers: 4 total
- ✅ 5000 - blockscout-1 (running)
- ✅ 6200 - firefly-1 (running)
- ⏸️ 6201 - firefly-ali-1 (stopped)
- ✅ 7811 - mim-api-1 (running)
- VMs: 0
Network Configuration
Interfaces
- Primary: vmbr0 (bridge)
- IP: 192.168.11.12/24
- Gateway: 192.168.11.1
- Bridge Port: nic2
- Physical: nic2 (UP, connected to vmbr0)
- Other NICs: nic0, nic1, nic3 (DOWN)
VLAN Configuration
- Bridge: vmbr0 (VLAN-aware)
- Native VLAN: 1 (untagged)
- Container VLANs: All on VLAN 1 (untagged)
- Note: ⚠️ Containers are not using VLAN tags - all on native VLAN
Routing
- Default Gateway: 192.168.11.1 (VLAN 11)
- Local Network: 192.168.11.0/24
Cluster Status
Cluster Membership
- Cluster Name: h
- Status: ✅ Quorate (3/3 nodes)
- Nodes:
- Node 1: ml110 (192.168.11.10)
- Node 2: r630-01 (192.168.11.11)
- Node 3: r630-02 (192.168.11.12) - Local
Quorum
- Expected Votes: 3
- Total Votes: 3
- Quorum: 2
- Status: ✅ Quorate
Storage Status
Storage Pools
| Name | Type | Status | Total | Used | Available | Usage % |
|---|---|---|---|---|---|---|
| local | dir | ✅ Active | 220GB | 7.3GB | 212GB | 3.31% |
| thin1-r630-02 | lvmthin | ✅ Active | 226GB | 200GB | 26GB | ⚠️ 88.51% |
| thin2 | lvmthin | ✅ Active | 226GB | 200GB | 26GB | ⚠️ 88.33% |
| thin3 | lvmthin | ✅ Active | 226GB | 0GB | 226GB | 0.00% |
| thin4 | lvmthin | ✅ Active | 226GB | 29GB | 197GB | 12.69% |
| thin5 | lvmthin | ✅ Active | 226GB | 0GB | 226GB | 0.00% |
| thin6 | lvmthin | ✅ Active | 226GB | 0GB | 226GB | 0.00% |
Concerns:
- ⚠️ thin1-r630-02: 88.51% full (26GB free)
- ⚠️ thin2: 88.33% full (26GB free)
- ⚠️ Consider cleanup or expansion
System Logs
Recent Activity
- ✅ Cluster filesystem verification: Successful (hourly)
- ✅ Container operations: Normal (start, stop, backup)
- ✅ System updates: Regular apt updates
Errors/Warnings
⚠️ CRITICAL: Memory OOM (Out of Memory) Kills
Multiple containers have been killed due to memory exhaustion:
| Date | Process | Container | Issue |
|---|---|---|---|
| Jan 14 01:39 | npm exec func s | UID:100000 | OOM killed (708MB) |
| Jan 14 07:42 | systemd-journal | UID:100000 | OOM killed (39MB) |
| Jan 14 07:42 | npm exec func s | UID:100000 | OOM killed (633MB) |
| Jan 14 09:37 | apt-get | UID:100000 | OOM killed (88MB) |
| Jan 14 11:10 | node | UID:100000 | OOM killed (331MB) |
| Jan 14 13:01 | python3 | UID:100000 | OOM killed (38MB) |
| Jan 14 16:06 | npm exec func s | UID:100000 | OOM killed (633MB) |
| Jan 14 16:40 | systemd-journal | UID:100000 | OOM killed (31MB) |
| Jan 14 16:48 | networkd-dispat | UID:100000 | OOM killed (29MB) |
Analysis:
- Multiple containers experiencing OOM kills
- Pattern suggests memory limits may be too restrictive
- Some containers (npm/node processes) consuming significant memory
- Systemd-journal and networkd-dispat also being killed
Recommendation:
- Review container memory limits
- Increase memory allocation for affected containers
- Monitor memory usage patterns
- Consider adding swap (currently 0B)
Other Issues:
- ⚠️ Subscription check failed: DNS resolution issue (non-critical)
Firewall Status
Proxmox Firewall
- Status: ✅ Enabled/Running
- Configuration: Active
Firewall Rules
- ✅ Rule: Allow Default Network (192.168.0.0/24)
- Direction: IN
- Action: ACCEPT
- Source: 192.168.0.0/24
- Logging: Disabled
Note: Firewall is properly configured to allow access from Default network.
Summary
✅ Operational Status
System: ✅ OPERATIONAL
- Proxmox services running
- Cluster quorate (3/3 nodes)
- Network configured correctly
- Firewall enabled and configured
⚠️ Concerns
-
High Load Average:
- Load: 12.03, 11.65, 11.51
- With 56 CPU threads, this is moderate but worth monitoring
- May indicate CPU-intensive workloads
-
Memory OOM Kills:
- Multiple containers killed due to memory exhaustion
- Pattern suggests memory limits may be too restrictive
- Need to review and adjust container memory allocations
-
Storage Usage:
- thin1-r630-02: 88.51% full (26GB free)
- thin2: 88.33% full (26GB free)
- Consider cleanup or expansion
-
Container VLAN Assignment:
- All containers on native VLAN (untagged)
- Not utilizing VLAN plan yet
- Ready for VLAN migration
📋 Recommendations
-
Immediate:
- Review container memory limits
- Increase memory for containers experiencing OOM kills
- Monitor load average trends
-
Short-term:
- Clean up or expand thin1-r630-02 and thin2 storage pools
- Migrate containers to appropriate VLANs
- Add swap space (currently 0B)
-
Long-term:
- Optimize container resource allocation
- Plan storage expansion
- Complete VLAN migration
🎯 Overall Assessment
Status: ✅ OPERATIONAL WITH CONCERNS
The system is functional and stable, but requires attention to:
- Memory management (OOM kills)
- Storage capacity (thin pools nearly full)
- Load monitoring (high but manageable)
Ready for:
- ✅ Normal operations
- ✅ Container management
- ✅ VLAN migration (when ready)
- ⚠️ Memory optimization needed
Last Updated: 2026-01-15
Last Reviewed: 2026-01-15
Actions Taken: Memory limits adjusted, status re-checked