Tinfoil
TEE-Terminated TLSVerifiable private inference with Intel TDX and NVIDIA H100 CC
Tinfoil provides confidential inference with hardware-attestable privacy guarantees. Every request runs inside an Intel TDX enclave or on an NVIDIA H100 Confidential Computing GPU. Users can cryptographically verify that their prompts are processed in a trusted execution environment and never exposed to the host operator.
API Features
OpenAI Compatible ✓
Streaming ✓
Function Calling ✓
Batch Inference —
Vision ✓
Embeddings ✓
Attestation Status
View full details →Last checked: Jun 18, 2026, 10:01 AM
Fully verified Every check passed its required attestation, channel, and provenance gates.
9 verified 0 partial 0 failed 0 unavailable
Hardware 9/9 good Freshness Not shown Key binding 8/9 good Workload Not shown Provenance 9/9 good Availability 9/9 good
DeepSeek V4 Pro Intel TDX TEE-Terminated TLS Details Gemma 4 31B Intel TDX TEE-Terminated TLS Details GLM 5.1 Intel TDX TEE-Terminated TLS Details GPT-OSS 120B Intel TDX TEE-Terminated TLS Details GPT-OSS Safeguard 120B AMD SEV-SNP TEE-Terminated TLS Details Kimi K2.6 Intel TDX TEE-Terminated TLS Details Llama 3.3 70B AMD SEV-SNP TEE-Terminated TLS Details Qwen3-VL 30B Intel TDX TEE-Terminated TLS Details Router AMD SEV-SNP TEE-only Gateway Details
Models & Pricing
| Model | Family | Context | Max Out | Input /M | Output /M | Free |
|---|---|---|---|---|---|---|
| Kimi K2.6 | Moonshot AI | 256K | — | $1.50 | $5.25 | — |
| GLM 5.1 | ZhipuAI GLM | 200K | — | $1.50 | $5.25 | — |
| DeepSeek V4 Pro | DeepSeek | 800K | — | $1.50 | $5.25 | — |
| Gemma 4 31B | Google Gemma | 256K | — | $0.45 | $1.00 | — |
| Qwen3-VL 30B A3B | Qwen | 256K | — | $1.25 | $4.00 | — |
| Llama 3.3 70B | Meta Llama | 128K | — | $1.75 | $2.75 | — |
| GPT-OSS 120B | OpenAI GPT | 128K | — | $0.75 | $1.25 | — |
| GPT-OSS Safeguard 120B | OpenAI GPT | 128K | — | $0.50 | $1.00 | — |
| Nomic Embed Text | Nomic | 8K | — | $0.05 | $0.00 | — |
| Voxtral Small 24B | Mistral | 32K | — | $0.20 | $0.60 | — |
| Whisper Large V3 Turbo | OpenAI GPT | 29K | — | $0.01/request | — | — |
| Qwen3-TTS 1.7B | Qwen | 4K | — | $0.01/request | — | — |