gpt-3.5-turbo — Fast, cost-effectivegpt-4 — Most capablegpt-4-turbo — Latest GPT-4 varianttext-embedding-ada-002 — Embeddingsmax_tokens — Response length limittemperature — Creativity (0–2)top_p — Nucleus samplingstream — Real-time responses