Files
proxmox/reports/r630-02-startup-failures-resolution.md
defiQUG fbda1b4beb
Some checks failed
Deploy to Phoenix / deploy (push) Has been cancelled
docs: Ledger Live integration, contract deploy learnings, NEXT_STEPS updates
- ADD_CHAIN138_TO_LEDGER_LIVE: Ledger form done; public code review repo bis-innovations/LedgerLive; init/push commands
- CONTRACT_DEPLOYMENT_RUNBOOK: Chain 138 gas price 1 gwei, 36-addr check, TransactionMirror workaround
- CONTRACT_*: AddressMapper, MirrorManager deployed 2026-02-12; 36-address on-chain check
- NEXT_STEPS_FOR_YOU: Ledger done; steps completable now (no LAN); run-completable-tasks-from-anywhere
- MASTER_INDEX, OPERATOR_OPTIONAL, SMART_CONTRACTS_INVENTORY_SIMPLE: updates
- LEDGER_BLOCKCHAIN_INTEGRATION_COMPLETE: bis-innovations/LedgerLive reference

Co-authored-by: Cursor <cursoragent@cursor.com>
2026-02-12 15:46:57 -08:00

3.8 KiB

R630-02 Container Startup Failures - Resolution

Date: January 19, 2026
Status: ROOT CAUSE IDENTIFIED - CONTAINERS ON WRONG NODE


Critical Finding

All 33 containers that failed to start on r630-02 do not exist on that node. They have been migrated to pve2 (192.168.11.11) and are currently stopped there.


Root Cause

The startup script attempted to start containers on r630-02, but:

  1. Container configuration files are missing on r630-02 (/etc/pve/lxc/XXXX.conf don't exist)
  2. Logical volumes are missing on r630-02 (no vm-XXXX-disk-X volumes)
  3. All containers exist on pve2 and are in "stopped" state

Conclusion: The containers were migrated from r630-02 to pve2, but the startup operation was attempted on the wrong node.


Container Locations

On pve2 (192.168.11.11) - All 33 containers found:

Logical Volume Error Containers (8):

  • CT 3000: ml110 - stopped
  • CT 3001: ml110 - stopped
  • CT 3002: ml110 - stopped
  • CT 3003: ml110 - stopped
  • CT 3500: oracle-publisher-1 - stopped
  • CT 3501: ccip-monitor-1 - stopped
  • CT 6000: fabric-1 - stopped
  • CT 6400: indy-1 - stopped

Startup Failure Containers (24):

  • CT 5200: cacti-1 - stopped
  • CT 10000-10092: Order management services (12 containers) - stopped
  • CT 10100-10151: DBIS Core services (6 containers) - stopped
  • CT 10200-10230: Order monitoring services (5 containers) - stopped

Lock Error Container (1):

  • CT 10232: CT10232 - stopped, locked in "create" state

Resolution

Since all containers exist on pve2, start them there:

./scripts/start-containers-on-pve2.sh

Option 2: Migrate Containers Back to r630-02

If containers should be on r630-02, migrate them back:

# For each container
pct migrate <VMID> r630-02 --storage thin1-r630-02 --restart

Note: This requires:

  • Available storage on r630-02
  • Network connectivity between nodes
  • Container data to be migrated

Next Steps

  1. Diagnostic Complete - Identified containers are on pve2
  2. Start Containers on pve2 - Use the start script
  3. Verify Services - Check that services start correctly
  4. Update Documentation - Document actual container locations

Container Inventory on pve2

All 33 containers are present on pve2:

VMID Hostname Status
3000 ml110 stopped
3001 ml110 stopped
3002 ml110 stopped
3003 ml110 stopped
3500 oracle-publisher-1 stopped
3501 ccip-monitor-1 stopped
5200 cacti-1 stopped
6000 fabric-1 stopped
6400 indy-1 stopped
10000 order-postgres-primary stopped
10001 order-postgres-replica stopped
10020 order-redis stopped
10030 order-identity stopped
10040 order-intake stopped
10050 order-finance stopped
10060 order-dataroom stopped
10070 order-legal stopped
10080 order-eresidency stopped
10090 order-portal-public stopped
10091 order-portal-internal stopped
10092 order-mcp-legal stopped
10100 dbis-postgres-primary stopped
10101 dbis-postgres-replica-1 stopped
10120 dbis-redis stopped
10130 dbis-frontend stopped
10150 dbis-api-primary stopped
10151 dbis-api-secondary stopped
10200 order-prometheus stopped
10201 order-grafana stopped
10202 order-opensearch stopped
10210 order-haproxy stopped
10230 order-vault stopped
10232 CT10232 stopped (locked)

Action Required

Immediate: Start containers on pve2 where they actually exist, not on r630-02.

Future: Update startup scripts to check container location before attempting to start, or migrate containers to intended nodes.