GPT-4 API 全面可用及 Completions API 中旧模型的淘汰

2024年12月8日修改

OpenAI 发布了一系列重要的更新。首先，自今日起，所有付费的 API 客户都可以访问具有 8K 上下文的 GPT-4。自三月份以来，数百万的开发者请求访问 GPT-4 API，利用 GPT-4 的创新产品范围也在日益扩大。OpenAI 计划在本月底向新开发者开放访问权限，并根据计算可用性逐步提高速率限制。

同时，基于这些模型在生产规模使用中的稳定性和就绪性，GPT-3.5 Turbo、DALL·E 和 Whisper APIs 也已普遍可用。OpenAI 正在努力安全地为 GPT-4 和 GPT-3.5 Turbo 实现微调功能，预计该功能将在今年晚些时候推出。

OpenAI 还引入了 Chat Completions API，并表示它现在占其 API GPT 使用量的 97%。这种基于聊天的范式被证明是强大的，能够处理绝大多数以前的用例和新的会话需求，同时提供更高的灵活性和特异性。OpenAI 计划将其平台的大部分努力继续投入到这个方向，以提供一个对开发者来说越来越强大和易于使用的体验。

然而，作为对 Chat Completions API 增加投资和优化计算能力的一部分，OpenAI 将在 6 个月后淘汰 Completions API 中的一些旧模型。从今日起，该 API 将在开发者文档中被标记为“遗留”。2024 年 1 月 4 日起，旧的完成模型将不再可用，并将被以下模型所取代：

|旧模型|新模型|

|---|---|

|ada|babbage-002|

|babbage|babbage-002|

|curie|davinci-002|

|davinci|davinci-002|

|davinci-instruct-beta|gpt-3.5-turbo-instruct|

|curie-instruct-beta|gpt-3.5-turbo-instruct|

|text-ada-001|gpt-3.5-turbo-instruct|

|text-babbage-001|gpt-3.5-turbo-instruct|

|text-curie-001|gpt-3.5-turbo-instruct|

|text-davinci-001|gpt-3.5-turbo-instruct|

|text-davinci-002|gpt-3.5-turbo-instruct|

|text-davinci-003|gpt-3.5-turbo-instruct|

使用稳定模型名称的应用程序（如<beginning of the code>ada<end of the code>，<beginning of the code>babbage<end of the code>，<beginning of the code>curie<end of the code>，<beginning of the code>davinci<end of the code>）将在 2024 年 1 月 4 日自动升级到上述新模型。新模型也将在未来几周内可供早期测试，通过在 API 调用中指定以下模型名称：<beginning of the code>babbage-002<end of the code>，<beginning of the code>davinci-002<end of the code>。使用其他旧完成模型（如<beginning of the code>text-davinci-003<end of the code>）的开发者需要在 2024 年 1 月 4 日之前通过在其 API 请求的“model”参数中指定<beginning of the code>gpt-3.5-turbo-instruct<end of the code>来手动升级其集成。<beginning of the code>gpt-3.5-turbo-instruct<end of the code>是一种 InstructGPT 风格的模型，与<beginning of the code>text-davinci-003<end of the code>的训练方式类似。这个新模型是 Completions API 中的直接替代品，并将在未来几周内可供早期测试。

对于希望在 2024 年 1 月 4 日之后继续使用其微调模型的开发者，他们需要在新的基础 GPT-3 模型（<beginning of the code>babbage-002<end of the code>，<beginning of the code>davinci-002<end of the code>）或更新的模型（<beginning of the code>gpt-3.5-turbo<end of the code>，<beginning of the code>gpt-4<end of the code>）上进行微调替换。一旦今年晚些时候该功能可用，OpenAI 将优先为之前微调过旧模型的用户提供 GPT-3.5 Turbo 和 GPT-4 微调的访问权限。OpenAI 意识到从基于自己数据微调的模型迁移是具有挑战性的，因此将为之前微调过模型的用户提供支持，以使过渡尽可能顺利。

此外，使用旧嵌入模型（如<beginning of the code>text-search-davinci-doc-001<end of the code>）的用户需要在 2024 年 1 月 4 日之前迁移到<beginning of the code>text-embedding-ada-002<end of the code>。OpenAI 于 2022 年 12 月发布了<beginning of the code>text-embedding-ada-002<end of the code>，发现它比以前的模型更强大且更具成本效益。如今，<beginning of the code>text-embedding-ada-002<end of the code>占所有嵌入 API 使用量的 99.9%。OpenAI 认识到这对使用这些旧模型的开发者来说是一个重大变化，并将承担用户使用这些新模型重新嵌入内容的财务成本，并在未来几天与受影响的用户联系。

最后，使用 Edits API 及其相关模型（如<beginning of the code>text-davinci-edit-001<end of the code>或<beginning of the code>code-davinci-edit-001<end of the code>）的用户需要在 2024 年 1 月 4 日之前迁移到 GPT-3.5 Turbo。Edits API 测试版是一个早期的探索性 API，旨在使开发者能够根据指令返回提示的编辑版本。OpenAI 在开发<beginning of the code>gpt-3.5-turbo<end of the code>和 Chat Completions API 时考虑了来自 Edits API 的反馈，现在可以用于相同的目的。

总的来说，OpenAI 的这些更新旨在为开发者提供更强大、更高效、更安全的工具和服务，同时优化其计算能力和资源配置，推动人工智能技术的发展和应用。

GPT-4 API 全面可用及 Completions API 中旧模型的淘汰​

GPT-4 API 全面可用及 Completions API 中旧模型的淘汰