Question 1

Why choose Cluaiz?

Accepted Answer

Cluaiz is built for developers who demand absolute performance and privacy. It provides Native Execution Speed (bypassing middleware), Low-Latency Execution via FFI, Hardware Agnostic support for NVIDIA/Apple/AMD/ARM, 100% local privacy, and a future-proof architecture optimized for GGUF and beyond.

Question 2

Is Cluaiz an AI Model?

Accepted Answer

No. Cluaiz is the engine/infrastructure that runs the models. Think of it like an operating system for AI. We support state-of-the-art architectures like Transformers, Mixture-of-Experts (MoE), and BitNet b1.58 ternary models.

Question 3

What hardware is supported?

Accepted Answer

Cluaiz extracts maximum performance from almost any silicon. We support NVIDIA (CUDA Tensor Cores), Apple Silicon (Metal/MPS), AMD/Intel (Vulkan/ROCm), and even ARM processors like Raspberry Pi.

Question 4

Is it completely free and open-source?

Accepted Answer

Cluaiz is released under the Apache License 2.0. It is completely free for personal use, individuals, startups, and enterprise builders to maintain a decentralized, open-source technology framework.

Question 5

How does it achieve sub-microsecond latency?

Accepted Answer

Unlike traditional wrappers that communicate via HTTP or heavy API layers, Cluaiz uses Shared-Memory Signaling and Native FFI (Foreign Function Interfaces) to talk directly to the inference kernels. This cuts out all middleman overhead.

Question 6

Is Cluaiz fully stable and ready for production?

Accepted Answer

Cluaiz is currently in its Alpha phase of development. While it is highly capable and you can use it effectively right now, you might encounter occasional bugs as we rapidly innovate. We are constantly pushing updates to achieve full stability soon.

Question 7

Can I run custom GGUF models not listed in the Cluaiz Registry?

Accepted Answer

Yes, absolutely! You can run any standard GGUF file by simply providing its Hugging Face URL. However, because the engine is still in active development, we highly recommend using the officially supported models from our Registry first for the most stable and optimized experience.

Question 8

Which model formats does Cluaiz support?

Accepted Answer

Currently, Cluaiz exclusively provides full, native support for the GGUF format. Formats like AWQ, GPTQ, and others are not supported at this time. In the future, we will add support for AWQ (featuring concurrent 4-bit and 8-bit architecture support). However, GPTQ will not be supported.

Question 9

Does Cluaiz provide an API for external tools?

Accepted Answer

Yes, Cluaiz includes a built-in API. Currently, it offers basic compatibility so you can already connect with external chatbots and ecosystem tools. It is a work-in-progress, but full, comprehensive API support is coming very soon.

Question 10

Since Cluaiz uses FFI to bypass APIs for speed, won't providing an API make it slow?

Accepted Answer

No, the core engine speed remains completely unaffected. The internal runtime always executes via ultra-fast Native FFI and Shared-Memory. The API we provide is strictly an optional external compatibility layer for ecosystem tools. The actual inference and processing still happen at native hardware speeds.

Question 11

Is Cluaiz strictly for local desktops/laptops, or can it be deployed on production servers and cloud VPS?

Accepted Answer

Cluaiz scales seamlessly from 8GB laptops to multi-GPU enterprise servers. For servers, it offers Native Execution (running directly on any VPS with zero dependencies) and operates as a Pure C/C++ & Rust Daemon exposing a highly scalable API server. While the current focus is on desktop and server stability, mobile ecosystem support for native on-device execution is planned for the future.

Frequently Asked Questions

Still have questions?

Frequently Asked Questions

Still have questions?