http://spark1:8000/v1http://spark2:8000/v1qwen3 through the central routing layerhttp://spark3:8080spark3:4000/v1http://spark3:4000/v1gemma4 → Spark1 (vLLM)qwen3 → Spark2 (TensorRT-LLM)
http://spark3:3000/var/lib/docker/volumes/openplaud_audio/_datahttp://spark3:4000/v1