LMX Setup
This guide covers installing and configuring Opta LMX on your Apple Silicon machine. LMX is designed to run on a dedicated inference server — typically a dedicated Apple Silicon host on your local network.
Hardware Requirements
LMX requires Apple Silicon with sufficient unified memory to load your target models. The minimum and recommended configurations are:
Recommended Configurations
| Tier | Chip | Memory | Model Range |
|---|---|---|---|
| Minimum | M1 Pro / M2 Pro | 32GB | 7B - 14B models |
| Recommended | M2 Ultra / M3 Ultra | 64GB - 128GB | 30B - 70B models |
| Ideal | M3 Ultra | 192GB | 70B+ models, multiple concurrent |
Python Environment
LMX requires Python 3.12 or later. Use a virtual environment to isolate dependencies:
Installation
pip Install
Key dependencies installed:
mlx/mlx-lm— Apple MLX framework and model utilitiesfastapi+uvicorn— HTTP servertransformers— Tokenizer supporthuggingface-hub— Model downloading
Configuration
LMX reads configuration from environment variables and an optional config file. The primary settings are:
Starting LMX
Activate the virtual environment
Start the server
Verify the server is responding
launchd Service
For production use, run LMX as a launchd service so it starts automatically on boot and restarts on crash.
Verification
Run these checks from your MacBook to confirm LMX is accessible over the LAN: