“Latency–Throughput Tradeoffs of ONNX Runtime, TensorRT-LLM, VLLM, and Triton: An Empirical Comparison on 1B–3B Parameter LLM Inference”. Journal of Global Engineering Review, vol. 4, no. 1, Feb. 2026, pp. 173-82, https://doi.org/10.66372/JGER.v4i1.12.