Skip to Content
We are live but in Staging 🎉
ModelsAPI ReferenceOverview

Models — API Reference

Package: dodil.ignite.v1 · Service: ModelService

OpenAI/Cohere-compatible inference over a global model catalog. The HTTP surface is built to drop in behind an existing OpenAI or Cohere client: the paths are top-level OpenAI-style (/v1/chat/completions, /v1/embeddings, /v1/rerank, /v1/audio/transcriptions, /v1/models) — not under /v1/ignite/ — and the JSON body matches the OpenAI/Cohere request exactly (max_tokens, top_p, top_n, response_format, …). Because the body is OpenAI/Cohere-native snake_case, the same JSON works over both HTTP and gRPC.

gRPC reaches every method at dodil.ignite.v1.ModelService/<Method> on $IGNITE_GRPC; the HTTP gateway mirrors each one. See Conventions → Using gRPC for grpcurl setup. HTTP base URL: https://api.dev.dodil.io. Auth is Authorization: Bearer $DODIL_TOKEN — except ListModels and GetModel, which are public (no auth).

A model’s declared input.format (from GetModel) tells you which RPC to call.

Sections

SectionRPCsHTTPstreaming
Chat CompletionsChatCompletionPOST /v1/chat/completionsunary
StreamChatCompletionPOST /v1/chat/completions/streamserver-stream
EmbeddingsEmbedPOST /v1/embeddingsunary
RerankRerankPOST /v1/rerankunary
StreamRerankPOST /v1/rerank/streamserver-stream
TranscriptionTranscribePOST /v1/audio/transcriptionsunary
Infer (generic)InferPOST /v1/inferunary
StreamInferPOST /v1/infer/streamserver-stream
Models (list & get)ListModelsGET /v1/modelsunary (no auth)
GetModelGET /v1/models/{model}unary (no auth)

Polymorphic fields

OpenAI-polymorphic request/response fields — content, tool_choice, response_format, embedding input — are carried as google.protobuf.Value, so you can paste the exact OpenAI/Cohere JSON object verbatim. See Conventions for wire and streaming notes.


See also