POST /chat/intelligent
Streaming inference through the private control plane. Use model: auto when routing is enabled.
Developer docs
A practical first-pass reference for teams evaluating Gridlight: API shape, deployment patterns, security posture, and what to validate during a technical briefing.
Unified API
Applications should not hardcode model vendors or hardware paths. They call Gridlight, and the control plane handles model availability, policy, capacity, routing, and lineage.
const response = await fetch('https://your-gridlight-instance/chat/intelligent', {
method: 'POST',
headers: {
Authorization: `Bearer ${token}`,
'Content-Type': 'application/json'
},
body: JSON.stringify({
model: 'auto',
policy: 'private-default',
messages: [{ role: 'user', content: prompt }],
stream: true
})
});/chat/intelligentStreaming inference through the private control plane. Use model: auto when routing is enabled.
/neonSynchronous JSON response for application workflows that do not need streaming.
/modelsList available models, versions, classifications, and routing policies.
/capacityRead TFLOP allocation, queue depth, and worker-node telemetry.
/audit/logsExport inference lineage and policy decisions to your SIEM or audit store.
/policyUpdate governed routing, data classification, and access policies.
Security posture
Data gravity
Deployment topologies
Start with one data-center deployment for internal apps, local RAG, and governed inference.
Run inference close to plants, clinics, branches, or robots while maintaining centralized policy.
Use the same API and governance model inside controlled private-cloud environments.
Keep application architecture stable while selectively allowing cloud AI as a fallback or preferred route.
Technical briefing