SGLang
SGLang (short for Structured Generation Language) is an open-source framework for programming and serving large language models and multimodal models. It was introduced by researchers affiliated with LMSYS and other institutions as a system combining a Python-embedded language for structured generation with a runtime for high-throughput inference.