- Organized 252 files across project - Root directory: 187 → 2 files (98.9% reduction) - Moved configuration guides to docs/04-configuration/ - Moved troubleshooting guides to docs/09-troubleshooting/ - Moved quick start guides to docs/01-getting-started/ - Moved reports to reports/ directory - Archived temporary files - Generated comprehensive reports and documentation - Created maintenance scripts and guides All files organized according to established standards.
484 lines
14 KiB
Markdown
484 lines
14 KiB
Markdown
# Proxmox VE Comprehensive Configuration Review
|
|
|
|
**Last Updated:** 2025-01-20
|
|
**Document Version:** 1.0
|
|
**Status:** Active Documentation
|
|
|
|
---
|
|
|
|
## Executive Summary
|
|
|
|
### ✅ Completed Tasks
|
|
- [x] Hostname migration (pve → r630-01, pve2 → r630-02)
|
|
- [x] IP address audit (no conflicts found)
|
|
- [x] Proxmox services verified (all operational)
|
|
- [x] Storage configuration reviewed
|
|
|
|
### ⚠️ Issues Identified
|
|
- r630-01 and r630-02 have LVM thin storage **disabled**
|
|
- All VMs/containers currently on ml110 only
|
|
- Storage not optimized for performance on r630-01/r630-02
|
|
|
|
---
|
|
|
|
## Hostname Migration - COMPLETE ✅
|
|
|
|
### Status
|
|
- **r630-01** (192.168.11.11): ✅ Hostname changed from `pve` to `r630-01`
|
|
- **r630-02** (192.168.11.12): ✅ Hostname changed from `pve2` to `r630-02`
|
|
|
|
### Verification
|
|
```bash
|
|
ssh root@192.168.11.11 "hostname" # Returns: r630-01 ✅
|
|
ssh root@192.168.11.12 "hostname" # Returns: r630-02 ✅
|
|
```
|
|
|
|
### Notes
|
|
- Both hosts are in a cluster (cluster name: "h")
|
|
- Cluster configuration may need update to reflect new hostnames
|
|
- /etc/hosts updated on both hosts for proper resolution
|
|
|
|
---
|
|
|
|
## IP Address Audit - COMPLETE ✅
|
|
|
|
### Results
|
|
- **Total VMs/Containers:** 34 with static IPs
|
|
- **IP Conflicts:** 0 ✅
|
|
- **Invalid IPs:** 0 ✅
|
|
- **DHCP IPs:** 2 (VMIDs 3500, 3501)
|
|
|
|
### All VMs Currently On
|
|
- **ml110** (192.168.11.10): All 34 VMs/containers
|
|
- **r630-01** (192.168.11.11): 0 VMs/containers
|
|
- **r630-02** (192.168.11.12): 0 VMs/containers
|
|
|
|
### IP Allocation Summary
|
|
| IP Range | Count | Purpose |
|
|
|----------|-------|---------|
|
|
| 192.168.11.57 | 1 | Firefly (stopped) |
|
|
| 192.168.11.60-63 | 4 | ML nodes |
|
|
| 192.168.11.64 | 1 | Indy |
|
|
| 192.168.11.80 | 1 | Cacti |
|
|
| 192.168.11.100-104 | 5 | Besu Validators |
|
|
| 192.168.11.105-106 | 2 | DBIS PostgreSQL |
|
|
| 192.168.11.112 | 1 | Fabric |
|
|
| 192.168.11.120 | 1 | DBIS Redis |
|
|
| 192.168.11.130 | 1 | DBIS Frontend |
|
|
| 192.168.11.150-154 | 5 | Besu Sentries |
|
|
| 192.168.11.155-156 | 2 | DBIS API |
|
|
| 192.168.11.201-204 | 4 | Named RPC |
|
|
| 192.168.11.240-242 | 3 | ThirdWeb RPC |
|
|
| 192.168.11.250-254 | 5 | Public RPC |
|
|
|
|
---
|
|
|
|
## Proxmox Host Configuration Review
|
|
|
|
### ml110 (192.168.11.10)
|
|
|
|
| Property | Value | Status |
|
|
|----------|-------|--------|
|
|
| **Hostname** | ml110 | ✅ Correct |
|
|
| **Proxmox Version** | 9.1.0 (kernel 6.17.4-1-pve) | ✅ Current |
|
|
| **CPU** | Intel Xeon E5-2603 v3 @ 1.60GHz (6 cores) | ⚠️ Older, slower |
|
|
| **Memory** | 125GB total, 94GB used, 31GB available | ⚠️ High usage |
|
|
| **Storage - local** | 94GB total, 7.4GB used (7.87%) | ✅ Good |
|
|
| **Storage - local-lvm** | 813GB total, 214GB used (26.29%) | ✅ Active |
|
|
| **VMs/Containers** | 34 total | ✅ All here |
|
|
|
|
**Storage Details:**
|
|
- `local`: Directory storage, active, 94GB available
|
|
- `local-lvm`: LVM thin, active, 600GB available
|
|
- `thin1-thin6`: Configured but disabled (not in use)
|
|
|
|
**Recommendations:**
|
|
- ⚠️ **CPU is older/slower** - Consider workload distribution
|
|
- ⚠️ **Memory usage high (75%)** - Monitor closely
|
|
- ✅ **Storage well configured** - LVM thin active and working
|
|
|
|
### r630-01 (192.168.11.11) - Previously "pve"
|
|
|
|
| Property | Value | Status |
|
|
|----------|-------|--------|
|
|
| **Hostname** | r630-01 | ✅ Migrated |
|
|
| **Proxmox Version** | 9.1.0 (kernel 6.17.4-1-pve) | ✅ Current |
|
|
| **CPU** | Intel Xeon E5-2630 v3 @ 2.40GHz (32 cores) | ✅ Good |
|
|
| **Memory** | 503GB total, 6.4GB used, 497GB available | ✅ Excellent |
|
|
| **Storage - local** | 536GB total, 0.1GB used (0.00%) | ✅ Available |
|
|
| **Storage - local-lvm** | **DISABLED** | ⚠️ **Issue** |
|
|
| **Storage - thin1-thin6** | **DISABLED** | ⚠️ **Issue** |
|
|
| **VMs/Containers** | 0 | ⏳ Ready for deployment |
|
|
|
|
**Storage Details:**
|
|
- **Volume Group:** `pve` exists with 2 physical volumes
|
|
- **Thin Pools:** `data` (200GB) and `thin1` (208GB) exist
|
|
- **Disks:** 4 disks (sda, sdb: 558GB each; sdc, sdd: 232GB each)
|
|
- **LVM Setup:** Properly configured
|
|
- **Storage Config Issue:** Storage configured but node references point to "pve" (old hostname) or "pve2"
|
|
|
|
**Issues:**
|
|
- ⚠️ **Storage configured but node references outdated** - Points to "pve" instead of "r630-01"
|
|
- ⚠️ **Storage may show as disabled** - Due to hostname mismatch in config
|
|
- ⚠️ **Need to update storage.cfg** - Update node references to r630-01
|
|
|
|
**Recommendations:**
|
|
- 🔴 **CRITICAL:** Enable local-lvm storage to use existing LVM thin pools
|
|
- 🔴 **CRITICAL:** Activate thin1 storage for better performance
|
|
- ✅ **Ready for VMs** - Excellent resources available
|
|
|
|
### r630-02 (192.168.11.12) - Previously "pve2"
|
|
|
|
| Property | Value | Status |
|
|
|----------|-------|--------|
|
|
| **Hostname** | r630-02 | ✅ Migrated |
|
|
| **Proxmox Version** | 9.1.0 (kernel 6.17.4-1-pve) | ✅ Current |
|
|
| **CPU** | Intel Xeon E5-2660 v4 @ 2.00GHz (56 cores) | ✅ Excellent |
|
|
| **Memory** | 251GB total, 4.4GB used, 247GB available | ✅ Excellent |
|
|
| **Storage - local** | 220GB total, 0.1GB used (0.06%) | ✅ Available |
|
|
| **Storage - local-lvm** | **DISABLED** | ⚠️ **Issue** |
|
|
| **Storage - thin1-thin6** | **DISABLED** | ⚠️ **Issue** |
|
|
| **VMs/Containers** | 0 | ⏳ Ready for deployment |
|
|
|
|
**Storage Details:**
|
|
- Need to check LVM configuration (command timed out)
|
|
- Storage shows as disabled in Proxmox
|
|
|
|
**Issues:**
|
|
- ⚠️ **Storage configured but node references outdated** - Points to "pve2" instead of "r630-02"
|
|
- ⚠️ **VMs already exist on storage** - Need to verify they're accessible
|
|
- ⚠️ **Need to update storage.cfg** - Update node references to r630-02
|
|
|
|
**Recommendations:**
|
|
- 🔴 **CRITICAL:** Check and configure LVM storage
|
|
- 🔴 **CRITICAL:** Enable local-lvm or thin storage
|
|
- ✅ **Ready for VMs** - Excellent resources available
|
|
|
|
---
|
|
|
|
## Storage Configuration Analysis
|
|
|
|
### Current Storage Status
|
|
|
|
| Host | Storage Type | Status | Size | Usage | Recommendation |
|
|
|------|--------------|--------|------|-------|----------------|
|
|
| **ml110** | local | ✅ Active | 94GB | 7.87% | ✅ Good |
|
|
| **ml110** | local-lvm | ✅ Active | 813GB | 26.29% | ✅ Good |
|
|
| **r630-01** | local | ✅ Active | 536GB | 0.00% | ✅ Ready |
|
|
| **r630-01** | local-lvm | ❌ Disabled | 0GB | N/A | 🔴 **Enable** |
|
|
| **r630-01** | thin1 | ❌ Disabled | 0GB | N/A | 🔴 **Enable** |
|
|
| **r630-02** | local | ✅ Active | 220GB | 0.06% | ✅ Ready |
|
|
| **r630-02** | local-lvm | ❌ Disabled | 0GB | N/A | 🔴 **Enable** |
|
|
| **r630-02** | thin1-thin6 | ❌ Disabled | 0GB | N/A | 🔴 **Enable** |
|
|
|
|
### Storage Issues
|
|
|
|
#### r630-01 Storage Issue
|
|
**Problem:** LVM thin pools exist (`data` 200GB, `thin1` 208GB) but Proxmox storage is disabled
|
|
|
|
**Root Cause:** Storage configured in Proxmox but not activated/enabled
|
|
|
|
**Solution:**
|
|
```bash
|
|
# Update storage.cfg node references on r630-01
|
|
ssh root@192.168.11.11
|
|
# Update node references from "pve" to "r630-01"
|
|
sed -i 's/nodes pve$/nodes r630-01/' /etc/pve/storage.cfg
|
|
sed -i 's/nodes pve /nodes r630-01 /' /etc/pve/storage.cfg
|
|
# Enable storage
|
|
pvesm set local-lvm --disable 0 2>/dev/null || true
|
|
pvesm set thin1 --disable 0 2>/dev/null || true
|
|
```
|
|
|
|
#### r630-02 Storage Issue
|
|
**Problem:** Storage disabled, LVM configuration unknown
|
|
|
|
**Solution:**
|
|
```bash
|
|
# Update storage.cfg node references on r630-02
|
|
ssh root@192.168.11.12
|
|
# Update node references from "pve2" to "r630-02"
|
|
sed -i 's/nodes pve2$/nodes r630-02/' /etc/pve/storage.cfg
|
|
sed -i 's/nodes pve2 /nodes r630-02 /' /etc/pve/storage.cfg
|
|
# Enable all thin storage pools
|
|
for storage in thin1 thin2 thin3 thin4 thin5 thin6; do
|
|
pvesm set "$storage" --disable 0 2>/dev/null || true
|
|
done
|
|
```
|
|
|
|
---
|
|
|
|
## Critical Recommendations
|
|
|
|
### 1. Enable LVM Thin Storage on r630-01 and r630-02 🔴 CRITICAL
|
|
|
|
**Priority:** HIGH
|
|
**Impact:** Cannot migrate VMs or create new VMs with optimal storage
|
|
|
|
**Action Required:**
|
|
1. Enable `local-lvm` storage on both hosts
|
|
2. Activate `thin1` storage pools if they exist
|
|
3. Verify storage is accessible and working
|
|
|
|
**Script Available:** `scripts/enable-local-lvm-storage.sh` (may need updates)
|
|
|
|
### 2. Distribute VMs Across Hosts ⚠️ RECOMMENDED
|
|
|
|
**Current State:** All 34 VMs on ml110 (overloaded)
|
|
|
|
**Recommendation:**
|
|
- Migrate some VMs to r630-01 and r630-02
|
|
- Balance workload across all three hosts
|
|
- Use r630-01/r630-02 for new deployments
|
|
|
|
**Benefits:**
|
|
- Better resource utilization
|
|
- Improved performance (ml110 CPU is slower)
|
|
- Better redundancy
|
|
|
|
### 3. Update Cluster Configuration ⚠️ RECOMMENDED
|
|
|
|
**Issue:** Hostnames changed but cluster may still reference old names
|
|
|
|
**Action:**
|
|
```bash
|
|
# Check cluster configuration
|
|
pvecm status
|
|
pvecm nodes
|
|
|
|
# Update if needed (may require cluster reconfiguration)
|
|
```
|
|
|
|
### 4. Storage Performance Optimization ⚠️ RECOMMENDED
|
|
|
|
**Current:**
|
|
- ml110: Using local-lvm (good)
|
|
- r630-01: Only local (directory) available (slower)
|
|
- r630-02: Only local (directory) available (slower)
|
|
|
|
**Recommendation:**
|
|
- Enable LVM thin storage on r630-01/r630-02 for better performance
|
|
- Use thin provisioning for space efficiency
|
|
- Monitor storage usage
|
|
|
|
### 5. Resource Monitoring ⚠️ RECOMMENDED
|
|
|
|
**ml110:**
|
|
- Memory usage: 75% (high) - Monitor closely
|
|
- CPU: Older/slower - Consider workload reduction
|
|
|
|
**r630-01/r630-02:**
|
|
- Excellent resources available
|
|
- Ready for heavy workloads
|
|
|
|
---
|
|
|
|
## Detailed Recommendations by Category
|
|
|
|
### Storage Recommendations
|
|
|
|
#### Immediate Actions
|
|
1. **Enable local-lvm on r630-01**
|
|
- LVM thin pools already exist
|
|
- Just need to activate in Proxmox
|
|
- Will enable efficient storage for VMs
|
|
|
|
2. **Configure storage on r630-02**
|
|
- Check LVM configuration
|
|
- Enable appropriate storage type
|
|
- Ensure compatibility with cluster
|
|
|
|
3. **Verify storage after enabling**
|
|
- Test VM creation
|
|
- Test storage migration
|
|
- Monitor performance
|
|
|
|
#### Long-term Actions
|
|
1. **Implement storage monitoring**
|
|
- Set up alerts for storage usage >80%
|
|
- Monitor thin pool usage
|
|
- Track storage growth trends
|
|
|
|
2. **Consider shared storage**
|
|
- For easier VM migration
|
|
- For better redundancy
|
|
- NFS or Ceph options
|
|
|
|
### Network Recommendations
|
|
|
|
#### Current Status
|
|
- All hosts on 192.168.11.0/24 network
|
|
- Flat network (no VLANs yet)
|
|
- Gateway: 192.168.11.1 (ER605-1)
|
|
|
|
#### Recommendations
|
|
1. **VLAN Migration** (Planned)
|
|
- Segment network by service type
|
|
- Improve security and isolation
|
|
- Better traffic management
|
|
|
|
2. **Network Monitoring**
|
|
- Monitor bandwidth usage
|
|
- Track network performance
|
|
- Alert on network issues
|
|
|
|
### Cluster Recommendations
|
|
|
|
#### Current Status
|
|
- Cluster name: "h"
|
|
- 3 nodes: ml110, r630-01, r630-02
|
|
- Cluster operational
|
|
|
|
#### Recommendations
|
|
1. **Update Cluster Configuration**
|
|
- Verify hostname changes reflected in cluster
|
|
- Update any references to old hostnames
|
|
- Test cluster operations
|
|
|
|
2. **Cluster Quorum**
|
|
- Ensure quorum is maintained
|
|
- Monitor cluster health
|
|
- Document cluster procedures
|
|
|
|
### Performance Recommendations
|
|
|
|
#### ml110
|
|
- **CPU:** Older/slower - Consider reducing workload
|
|
- **Memory:** High usage - Monitor and optimize
|
|
- **Storage:** Well configured - No changes needed
|
|
|
|
#### r630-01
|
|
- **CPU:** Good performance - Ready for workloads
|
|
- **Memory:** Excellent - Can handle many VMs
|
|
- **Storage:** Needs activation - Critical fix needed
|
|
|
|
#### r630-02
|
|
- **CPU:** Excellent (56 cores) - Best performance
|
|
- **Memory:** Excellent - Can handle many VMs
|
|
- **Storage:** Needs configuration - Critical fix needed
|
|
|
|
---
|
|
|
|
## Action Items
|
|
|
|
### Critical (Do Before Starting VMs)
|
|
|
|
1. ✅ **Hostname Migration** - COMPLETE
|
|
2. ✅ **IP Address Audit** - COMPLETE
|
|
3. 🔴 **Enable local-lvm storage on r630-01** - PENDING
|
|
4. 🔴 **Configure storage on r630-02** - PENDING
|
|
5. ⚠️ **Verify cluster configuration** - PENDING
|
|
|
|
### High Priority
|
|
|
|
1. ⚠️ **Test VM creation on r630-01/r630-02** - After storage enabled
|
|
2. ⚠️ **Update cluster configuration** - Verify hostname changes
|
|
3. ⚠️ **Plan VM distribution** - Balance workload across hosts
|
|
|
|
### Medium Priority
|
|
|
|
1. ⚠️ **Implement storage monitoring** - Set up alerts
|
|
2. ⚠️ **Document storage procedures** - For future reference
|
|
3. ⚠️ **Plan VLAN migration** - Network segmentation
|
|
|
|
---
|
|
|
|
## Verification Checklist
|
|
|
|
### Hostname Verification
|
|
- [x] r630-01 hostname correct
|
|
- [x] r630-02 hostname correct
|
|
- [x] /etc/hosts updated on both hosts
|
|
- [ ] Cluster configuration updated (if needed)
|
|
|
|
### IP Address Verification
|
|
- [x] No conflicts detected
|
|
- [x] No invalid IPs
|
|
- [x] All IPs documented
|
|
- [x] IP audit script working
|
|
|
|
### Storage Verification
|
|
- [x] ml110 storage working
|
|
- [ ] r630-01 local-lvm enabled
|
|
- [ ] r630-02 storage configured
|
|
- [ ] Storage tested and working
|
|
|
|
### Service Verification
|
|
- [x] All Proxmox services running
|
|
- [x] Web interfaces accessible
|
|
- [x] Cluster operational
|
|
- [ ] Storage accessible
|
|
|
|
---
|
|
|
|
## Next Steps
|
|
|
|
### Immediate (Before Starting VMs)
|
|
|
|
1. **Enable Storage on r630-01:**
|
|
```bash
|
|
ssh root@192.168.11.11
|
|
# Check current storage config
|
|
cat /etc/pve/storage.cfg
|
|
# Enable local-lvm
|
|
pvesm set local-lvm --disable 0
|
|
# Or reconfigure if needed
|
|
```
|
|
|
|
2. **Configure Storage on r630-02:**
|
|
```bash
|
|
ssh root@192.168.11.12
|
|
# Check LVM setup
|
|
vgs
|
|
lvs
|
|
# Configure appropriate storage
|
|
```
|
|
|
|
3. **Verify Storage:**
|
|
```bash
|
|
# On each host
|
|
pvesm status
|
|
# Should show local-lvm as active
|
|
```
|
|
|
|
### After Storage is Enabled
|
|
|
|
1. **Test VM Creation:**
|
|
- Create test container on r630-01
|
|
- Create test container on r630-02
|
|
- Verify storage works correctly
|
|
|
|
2. **Start VMs:**
|
|
- All IPs verified, no conflicts
|
|
- Hostnames correct
|
|
- Storage ready
|
|
|
|
---
|
|
|
|
## Scripts Available
|
|
|
|
1. **`scripts/check-all-vm-ips.sh`** - ✅ Working - IP audit
|
|
2. **`scripts/migrate-hostnames-proxmox.sh`** - ✅ Complete - Hostname migration
|
|
3. **`scripts/diagnose-proxmox-hosts.sh`** - ✅ Working - Diagnostics
|
|
4. **`scripts/enable-local-lvm-storage.sh`** - ⏳ May need updates for r630-01/r630-02
|
|
|
|
---
|
|
|
|
## Related Documentation
|
|
|
|
### Architecture Documents
|
|
- **[PHYSICAL_HARDWARE_INVENTORY.md](PHYSICAL_HARDWARE_INVENTORY.md)** ⭐⭐⭐ - Physical hardware inventory
|
|
- **[NETWORK_ARCHITECTURE.md](NETWORK_ARCHITECTURE.md)** ⭐⭐⭐ - Network architecture
|
|
- **[ORCHESTRATION_DEPLOYMENT_GUIDE.md](ORCHESTRATION_DEPLOYMENT_GUIDE.md)** ⭐⭐⭐ - Deployment orchestration
|
|
|
|
### Deployment Documents
|
|
- **[../03-deployment/PRE_START_CHECKLIST.md](../03-deployment/PRE_START_CHECKLIST.md)** - Pre-start checklist
|
|
- **[../03-deployment/LVM_THIN_PVE_ENABLED.md](../03-deployment/LVM_THIN_PVE_ENABLED.md)** - LVM thin storage setup
|
|
- **[../09-troubleshooting/STORAGE_MIGRATION_ISSUE.md](../09-troubleshooting/STORAGE_MIGRATION_ISSUE.md)** - Storage migration troubleshooting
|
|
|
|
---
|
|
|
|
**Last Updated:** 2025-01-20
|
|
**Document Version:** 1.0
|
|
**Review Cycle:** Quarterly
|