STACKIT Model Serving is a scalable, token-based service for the use of current open source large language models (LLMs). Using them via API allows easy integration into your own chatbots, RAG solutions and the generation of creative texts.