winyunq - LiteRT-LM-Unreal

🧠 LiteRT-LM-Unreal: High-Performance Local AI Inference via Google LiteRT

Inject a true "Local AI Soul" into your Unreal Engine projects. LiteRT-LM-Unreal is a professional-grade local Agent inference solution specifically engineered for AAA-tier projects. We bypass bloated middleware and high-level abstractions to run .litertlm format models natively through the Google LiteRT architecture.

📱 Multi-Platform Support (Demo)

We are actively expanding the execution environment for local inference:

⚖️ The Dual-Track Ecosystem: GitHub vs. Fab

  • GitHub Open-Source Edition: Our core logic is always open for "Protocol Exploration" and "Architectural Validation."

  • Fab Productivity Edition (This Product): The ultimate solution for Zero-Config Commercial Deployment. We provide pre-compiled, optimized binaries that eliminate the friction of low-level compilation. You are buying back your valuable engineering time and ensuring out-of-the-box stability.

🛠️ Core Engineering Features: The "Ultra" Standard

  1. 🚀 Ultra-Fast Inference (GPU/CPU Dual Engine):

    Native GPU acceleration support. The system automatically identifies the hardware environment; if GPU resources are restricted, it seamlessly switches to a high-efficiency CPU inference mode, ensuring AI logic never drops frames or blocks the game thread.

  2. 🧠 Ultra-Long Context (Hardware-Optimized Memory):

    By leveraging LiteRT-LM's extreme hardware efficiency, this plugin provides massive context memory within the UE5 runtime. This enables complex, long-duration Agent reasoning that traditional local solutions cannot sustain.

  3. 💾 Ultra-High Resource Utilization (Dynamic Budgeting):

    Deeply optimized for .litertlm format models. Our memory manager real-time calculates the remaining system/video memory after UE5’s overhead and precision-allocates the remaining budget, maximizing performance without compromising editor stability.

  4. 👥 Ultra-Many Sessions (Weight Sharing Mechanism):

    Supports a "Multi-Agent, Single-Weight" architecture. Through advanced context switching and cache indexing, multiple Agents can share the same model weights simultaneously. As long as memory permits, you can deploy an extensive array of local Agents.

  5. 🛡️ Zero-Friction Deployment (No Environment Required):

    The ultimate plug-and-play solution. You do NOT need to install or configure any complex environments—such as Bazel, Rust, Python, or specialized compilers—on your machine during development or at runtime. Everything is encapsulated within the binary for immediate deployment.

🔗 Essential Resources

Before purchase or integration, please review our technical derivation and documentation:

🎁 Early Access: Free Implementation Phase

Our documentation and tutorials are currently being refined to ensure a premium user experience. During this "Free Week" period, we invite you to explore the full potential of LiteRT-LM-Unreal at zero cost:

  • The Deal: Access the plugin for free now, join our Discord Community, and download the standalone Demo to verify performance.

  • Our Request: If this architecture solves your local AI needs, we would greatly appreciate a 5-star review to support the project's growth.

  • Report an Issue on GitHub: If something isn't working as expected, please open an issue. I'll prioritize fixes for the community.If you find this project valuable, please consider giving us a Star on GitHub.

  • Future Change: Please note that in 30 days(Before May 30), as the documentation reaches its 1.0 version, the price will be adjusted to $19.99.

Grab it now and build the future of local AI together!