TensorSharp has been introduced—a new open-source inference engine that allows running Large Language Models (LLMs) locally using the C# language. The project supports the GGUF format and provides a suite of tools ranging from a console application to a web interface and an HTTP API compatible with OpenAI and Ollama.

image
image

What Happened

TensorSharp, an inference engine for local LLM execution in C#, has been developed. The system works on Windows, macOS, and Linux, supports GPU acceleration, and utilizes the GGUF format. The solution includes a console application, a web-based chatbot interface, and an HTTP API that supports Ollama and OpenAI protocols.

Context

Most modern tools for local neural network execution are oriented toward the Python stack. TensorSharp offers an alternative path, allowing the integration of LLM capabilities directly into the .NET ecosystem without the need for Python middleware or complex wrappers.

Why It Matters for the Industry

The emergence of a high-quality C# engine expands the toolkit for local inference and diversifies the market by providing an alternative to Python-centric solutions. This simplifies the implementation of AI features into enterprise and industrial software, where .NET is a standard and dependency on a Python environment is undesirable.

Why It Matters for Users

.NET developers can now create native desktop and server applications with local artificial intelligence using the familiar C# language. Thanks to its OpenAI/Ollama-compatible API support, transitioning to this new tool for existing projects will be as seamless as possible.

What Is Not Yet Known / Limitations

Additional verification of the engine's performance in real-world production scenarios is required.

Sources

Author

Look at AI, Editorial Team