This chapter is for contributors and maintainers.

System Overview

NeuralDrive is a specialized Linux distribution designed to function as a headless LLM appliance. It prioritizes reliability, security, and ease of use by abstracting the complexities of GPU drivers and model orchestration.

Runtime Stack

The system follows a layered architecture that moves from low-level hardware management to high-level user interfaces.

+-------------------------------------------------------+
|                    Web Browser (UI)                   |
+-------------------------------------------------------+
                           | (HTTPS)
+-------------------------------------------------------+
|                     Caddy Proxy                       |
|   (TLS, Routing, Authentication, Rate Limiting)       |
+-----------+---------------+-----------+---------------+
            |               |           |
+-----------v-----------+   |   +-------v-------+   +---|---+
|      Open WebUI       |   |   |   System API  |   |  TUI  |
| (Frontend Application)|   |   |   (FastAPI)   |   | (TTY) |
+-----------+-----------+   |   +-------+-------+   +---|---+
            |               |           |               |
+-----------v---------------v-----------v---------------v-------+
|                           Ollama                             |
|              (Inference Engine & Model Manager)              |
+-------------------------------+-------------------------------+
                                |
+-------------------------------v-------------------------------+
|                      GPU Hardware / Drivers                  |
|               (NVIDIA CUDA, AMD ROCm, Intel OneAPI)          |
+---------------------------------------------------------------+
|                        Debian 12 Base                        |
+---------------------------------------------------------------+

Component Roles

Caddy Proxy

Acts as the secure gateway for the entire appliance. It handles TLS termination using self-signed or ACME-provided certificates. Caddy routes traffic to the appropriate backend service based on the URL path and enforces Bearer token authentication for API requests.

User Request: An HTTPS request arrives at Caddy on port 443.
Routing: Caddy determines if the request is for the WebUI (/), the inference API (/v1/), or the System API (/system/).
Authentication: If the request is for an API endpoint, Caddy verifies the Bearer token.
Backend Processing: The request is proxied to the relevant local service (e.g., localhost:11434 for Ollama).
Response: The backend service returns data to Caddy, which then passes it back to the user over the encrypted connection.

NeuralDrive Developer Guide

System Overview

Runtime Stack

Component Roles

Caddy Proxy

Ollama

Open WebUI

System API

Textual TUI

Data Flow