Files

Deploy to Phoenix / deploy (push) Has been cancelled

Details

docs: Ledger Live integration, contract deploy learnings, NEXT_STEPS updates

- ADD_CHAIN138_TO_LEDGER_LIVE: Ledger form done; public code review repo bis-innovations/LedgerLive; init/push commands
- CONTRACT_DEPLOYMENT_RUNBOOK: Chain 138 gas price 1 gwei, 36-addr check, TransactionMirror workaround
- CONTRACT_*: AddressMapper, MirrorManager deployed 2026-02-12; 36-address on-chain check
- NEXT_STEPS_FOR_YOU: Ledger done; steps completable now (no LAN); run-completable-tasks-from-anywhere
- MASTER_INDEX, OPERATOR_OPTIONAL, SMART_CONTRACTS_INVENTORY_SIMPLE: updates
- LEDGER_BLOCKCHAIN_INTEGRATION_COMPLETE: bis-innovations/LedgerLive reference

Co-authored-by: Cursor <cursoragent@cursor.com>

2026-02-12 15:46:57 -08:00

5.7 KiB

Raw Blame History

r630-02 Memory Limit Fix - Complete

Date: 2026-01-19
Status: ✅ COMPLETE

Executive Summary

All immediate actions from the log review have been resolved. Memory limits for all containers on r630-02 have been increased to appropriate levels to prevent OOM (Out of Memory) kills.

Actions Taken

1. ✅ Memory Limits Updated

All 7 containers have had their memory limits increased significantly:

VMID	Name	Old Limit	New Limit	New Swap	Status
5000	blockscout-1	8MB	2GB	1GB	✅ Updated
6200	firefly-1	4MB	512MB	256MB	✅ Updated
6201	firefly-ali-1	2MB	512MB	256MB	✅ Updated
7810	mim-web-1	4MB	256MB	128MB	✅ Updated
7811	mim-api-1	4MB	1GB	512MB	✅ Updated
8641	vault-phoenix-2	4MB	512MB	256MB	✅ Updated
10234	npmplus-secondary	1MB	24GB	4GB	✅ Updated

2. ✅ Containers Restarted

All containers have been restarted to apply the new memory limits immediately.

Problem Analysis

Root Cause

The containers had extremely low memory limits that were completely inadequate for their actual usage:

Container 5000 (blockscout-1): 8MB limit but using 736MB → 92x over limit
Container 6200 (firefly-1): 4MB limit but using 182MB → 45x over limit
Container 6201 (firefly-ali-1): 2MB limit but using 190MB → 95x over limit
Container 7810 (mim-web-1): 4MB limit but using 40MB → 10x over limit
Container 7811 (mim-api-1): 4MB limit but using 90MB → 22x over limit (most affected)
Container 8641 (vault-phoenix-2): 4MB limit but using 68MB → 17x over limit
Container 10234 (npmplus-secondary): 1MB limit but using 20,283MB → 20,283x over limit

This explains why containers were experiencing frequent OOM kills, especially container 7811 (mim-api-1).

Impact

Before: Containers were constantly hitting memory limits, causing:
- Process kills (systemd-journal, node, npm, apt-get, etc.)
- Service interruptions
- Application instability
- Poor performance
After: Containers now have adequate memory limits with:
- Headroom for normal operation
- Swap space for temporary spikes
- Reduced risk of OOM kills
- Improved stability

New Memory Configuration

Memory Limits (Based on Usage + Buffer)

Container	Current Usage	New Limit	Buffer	Rationale
blockscout-1	736MB	2GB	1.3GB	Large application, needs headroom
firefly-1	182MB	512MB	330MB	Standard application
firefly-ali-1	190MB	512MB	322MB	Standard application
mim-web-1	40MB	256MB	216MB	Lightweight web server
mim-api-1	90MB	1GB	910MB	Critical container with OOM issues
vault-phoenix-2	68MB	512MB	444MB	Vault service needs stability
npmplus-secondary	20,283MB	24GB	3.7GB	Large application, high usage

Swap Configuration

All containers now have swap space configured to handle temporary memory spikes:

blockscout-1: 1GB swap
firefly-1, firefly-ali-1, vault-phoenix-2: 256MB swap each
mim-web-1: 128MB swap
mim-api-1: 512MB swap (critical container)
npmplus-secondary: 4GB swap

Verification

Current Status

All containers are:

✅ Running with new memory limits
✅ Restarted and operational
✅ No immediate OOM kills detected

Monitoring Recommendations

Monitor OOM Events:

ssh root@192.168.11.12 'journalctl | grep -i "oom\|out of memory" | tail -20'

Check Memory Usage:

./scripts/check-container-memory-limits.sh

Watch for Patterns:
- Monitor if containers approach their new limits
- Adjust limits if needed based on actual usage patterns
- Watch for any new OOM kills

Scripts Created

scripts/check-container-memory-limits.sh
- Check current memory limits and usage for all containers
- Usage: ./scripts/check-container-memory-limits.sh
scripts/fix-container-memory-limits.sh
- Update memory limits for all containers
- Usage: ./scripts/fix-container-memory-limits.sh

Next Steps

Immediate (Completed)

✅ Updated all memory limits
✅ Restarted all containers
✅ Verified new limits are applied

Short-term (Recommended)

Monitor for 24-48 hours:
- Check for any new OOM kills
- Verify containers are stable
- Monitor memory usage patterns
Fine-tune if needed:
- Adjust limits based on actual usage
- Optimize applications if they're using excessive memory

Long-term (Optional)

Implement monitoring:
- Set up alerts for memory usage approaching limits
- Track memory usage trends
- Document optimal memory allocations
Optimize applications:
- Review applications for memory leaks
- Optimize memory usage where possible
- Consider application-level memory limits

Summary

Status: ✅ ALL IMMEDIATE ACTIONS RESOLVED

✅ Memory limits increased for all 7 containers
✅ Swap space configured for all containers
✅ Containers restarted with new limits
✅ Critical container 7811 (mim-api-1) now has 1GB memory (up from 4MB)
✅ All containers operational and stable

Expected Outcome:

Significant reduction in OOM kills
Improved container stability
Better application performance
Reduced service interruptions

Monitoring:

Continue monitoring logs for OOM events
Verify containers remain stable
Adjust limits if needed based on usage patterns

Resolution completed: 2026-01-19
Next review: Monitor for 24-48 hours to verify stability

5.7 KiB Raw Blame History