寂傧 发表于 2025-8-10 14:49:31

GPT‑5 重磅发布

当下最强“内置思考”模型来了。GPT‑5 将专家级智能交到每个人手中:在编码、数学、写作、健康与多模态理解上全面跃升;它会判断何时“秒回”,何时“深想”,用更少的计算交付更好的答案。

[*]普通用户:直接用,更快更准。
[*]Plus/Pro 用户:更高配额;Pro 可启用扩展推理,答案更全面、更稳定。
[*]团队/企业/教育:默认高上限,真正可日用。
为什么是 GPT‑5:一个统一的系统,自动选择“快”与“深”


[*]三件套:高效通用模型 + 深度推理模型(“GPT‑5 思维”)+ 实时路由器。
[*]路由器依据对话类型、复杂度、工具需求与你的明确意图(如“认真思考一下”)自动选择策略,并持续从用户真实反馈学习。
[*]达到使用上限时,自动回落到对应的 mini 版本,保证连续可用。
三大高频场景的跃迁:写作、编程、健康


[*]更少幻觉,更强指令执行,显著降低“讨好式”回答。
[*]写作:结构更稳、文采与节奏兼具,能处理自由诗、抑扬格等“模糊结构”。
[*]编程:前端整页生成、跨大仓调试、一把梭极快出原型。
[*]健康:在 HealthBench 等评测中显著领先,更好地追问关键信息,输出贴合用户背景的安全建议(不替代专业医生)。
编程生产力飞跃:一次提示,直接出作品

GPT‑5 是迄今最强编码模型,复杂前端/大仓调试都很稳。它常常“一次提示”搞定漂亮、响应式的网站/应用/小游戏,且细节精致。
以下示例均由“一个提示”生成(链接含完整开发能力说明):

[*]开发者详解:https://openai.com/index/introducing-gpt-5-for-developers/
滚球小游戏 Jumping Ball Runner

Prompt:
Create a single-page app in a single HTML file with the following requirements:
- Name: Jumping Ball Runner
- Goal: Jump over obstacles to survive as long as possible.
- Features: Increasing speed, high score tracking, retry button, and funny sounds for actions and events.
- The UI should be colorful, with parallax scrolling backgrounds.
- The characters should look cartoonish and be fun to watch.
- The game should be enjoyable for everyone.https://p0-xtjj-private.juejin.cn/tos-cn-i-73owjymdk6/ffc5c2a00213484db39361f7e2bbd42b~tplv-73owjymdk6-jj-mark-v1:0:0:0:0:5o6Y6YeR5oqA5pyv56S-5Yy6IEAg56iL5bqP5ZGYTkVP:q75.awebp?policy=eyJ2bSI6MywidWlkIjoiMTk5MDU1NzAzMDY3OTY3MiJ9&rk3s=e9ecf3d6&x-orig-authkey=f32326d3454f2ac7e96d3d06cdbb035152127018&x-orig-expires=1754900311&x-orig-sign=ZEQtGbyW8Y9w9Em7ZpBk7t%2FC%2FKc%3D
图:色彩跃动、视差滚动与难度递增机制,含高分追踪与重试。
像素艺术工作台

Prompt:
Create a single-page app, in a single HTML file, that provides a retro pixel painting experience.
- Canvas: fixed pixel grid with zoom; tools for pencil, eraser, fill, line, rectangle, circle; grid toggle.
- Palette: 16-color swatches with two custom slots; eyedropper; foreground/background swap.
- Editing: undo/redo, copy/paste selection, flip/rotate selection, clear canvas; status bar with cursor coords.
- UI shell: faux OS window (’90s style) with draggable title bar, toolbar icons, tooltip hints.
- Import/Export: import PNG (quantize to palette) and export PNG/SpriteSheet + JSON; save/load from localStorage.
- Shortcuts: number keys for tools, +/- for zoom; accessible labels and focus order.
- Responsive layout; no uploads to servers.https://p0-xtjj-private.juejin.cn/tos-cn-i-73owjymdk6/350e421e40f24c4aaf84f0c577d4e060~tplv-73owjymdk6-jj-mark-v1:0:0:0:0:5o6Y6YeR5oqA5pyv56S-5Yy6IEAg56iL5bqP5ZGYTkVP:q75.awebp?policy=eyJ2bSI6MywidWlkIjoiMTk5MDU1NzAzMDY3OTY3MiJ9&rk3s=e9ecf3d6&x-orig-authkey=f32326d3454f2ac7e96d3d06cdbb035152127018&x-orig-expires=1754900311&x-orig-sign=RfhxNx%2BVY0UnKYhYXn7HMP2GMW8%3D
图:90 年代 OS 质感外壳 + 全工具链像素编辑与本地存取。
打字速度赛

Prompt:
Create a single-page app in a single HTML file with the following requirements:
- Name: Typing Speed Race
- Goal: Test WPM and accuracy in a timed typing challenge.
- Features: Random paragraph generator, error highlighting, live WPM display, countdown animation, history chart.
- The UI should be clean, with high-contrast text and a large typing area.https://p0-xtjj-private.juejin.cn/tos-cn-i-73owjymdk6/f3dc656d6a494fee96ac66ef2fcbe1d9~tplv-73owjymdk6-jj-mark-v1:0:0:0:0:5o6Y6YeR5oqA5pyv56S-5Yy6IEAg56iL5bqP5ZGYTkVP:q75.awebp?policy=eyJ2bSI6MywidWlkIjoiMTk5MDU1NzAzMDY3OTY3MiJ9&rk3s=e9ecf3d6&x-orig-authkey=f32326d3454f2ac7e96d3d06cdbb035152127018&x-orig-expires=1754900311&x-orig-sign=GvmD%2BldJ1yMlwD3jQALUkTSHo6Q%3D
图:实时 WPM/准确率与历史曲线,练习反馈闭环清晰。
鼓模拟器

Prompt:
Create a single-page app in a single HTML file with the following requirements:
- Name: Virtual Drum Kit
- Goal: Play a drum kit using keyboard or clicks.
- Features: Multiple drum sounds, record and playback mode.
- The UI should be music-studio themed, polished, modern. Make it as beautiful as possible.https://p0-xtjj-private.juejin.cn/tos-cn-i-73owjymdk6/96aa970a3b85453881cf24e885570dff~tplv-73owjymdk6-jj-mark-v1:0:0:0:0:5o6Y6YeR5oqA5pyv56S-5Yy6IEAg56iL5bqP5ZGYTkVP:q75.awebp?policy=eyJ2bSI6MywidWlkIjoiMTk5MDU1NzAzMDY3OTY3MiJ9&rk3s=e9ecf3d6&x-orig-authkey=f32326d3454f2ac7e96d3d06cdbb035152127018&x-orig-expires=1754900311&x-orig-sign=R5MSLzwpj6JDHAHRcVj6qLS0av4%3D
图:键盘/点击演奏 + 录制回放,工作室风格 UI。
Lofi 可视化器(React + Canvas)

Prompt:
Generate a React + Canvas “Lo-Fi Visualiser” that animates bars and waves to a vaporwave track (no file uploads, use a bundled tone); provide 3 visual styles (bars, dots, grid) and sliders for speed, density, and glow; surround with a Windows-’96 chrome (reminiscent of the file explorer interface), pixel buttons, and a hue wheel to recolour the scene.https://p0-xtjj-private.juejin.cn/tos-cn-i-73owjymdk6/581a7b866c2f4be5964d0493a992102e~tplv-73owjymdk6-jj-mark-v1:0:0:0:0:5o6Y6YeR5oqA5pyv56S-5Yy6IEAg56iL5bqP5ZGYTkVP:q75.awebp?policy=eyJ2bSI6MywidWlkIjoiMTk5MDU1NzAzMDY3OTY3MiJ9&rk3s=e9ecf3d6&x-orig-authkey=f32326d3454f2ac7e96d3d06cdbb035152127018&x-orig-expires=1754900311&x-orig-sign=JaPBAdfAf4%2BRSttQd%2FRvPamdK9M%3D
图:多风格动效 + 色相轮调色,沉浸式“蒸汽波”体验。
创造性表达与写作:更能“既讲形式,又达意”


[*]能处理模糊结构(如不押韵的抑扬格、自然的自由诗),在“形式感”与“表达清晰度”之间取得平衡。
[*]日常文书(报告、邮件、备忘)也更稳、更贴上下文。
诗歌对比(同一提示“京都寡妇与袜子”)中,GPT‑5 的结尾更有张力、意象更鲜明,文化地域感更强,避免“直说不示”的套路表达。
评测:学术与人工评审双线突破


[*]数学(AIME 2025,无工具):94.6%
[*]真实世界编码:SWE‑bench Verified 74.9%,Aider Polyglot 88%
[*]多模态理解(MMMU):84.2%
[*]健康(HealthBench Hard):46.2%
[*]扩展推理(“GPT‑5 思维/Pro”):GPQA 无工具最高达 88.4%
提示:使用工具的 AIME 与“无工具”成绩不可直接横比,它展示了 GPT‑5 对工具的有效利用。
https://p0-xtjj-private.juejin.cn/tos-cn-i-73owjymdk6/536d8329c6fb4831a92c9256bbed1d78~tplv-73owjymdk6-jj-mark-v1:0:0:0:0:5o6Y6YeR5oqA5pyv56S-5Yy6IEAg56iL5bqP5ZGYTkVP:q75.awebp?policy=eyJ2bSI6MywidWlkIjoiMTk5MDU1NzAzMDY3OTY3MiJ9&rk3s=e9ecf3d6&x-orig-authkey=f32326d3454f2ac7e96d3d06cdbb035152127018&x-orig-expires=1754900311&x-orig-sign=UQJy2Ij8xt%2Btzo8tfSWDefz7Q%2BM%3D
图:跨学科评测整体跃升与对比。
https://p0-xtjj-private.juejin.cn/tos-cn-i-73owjymdk6/408f2efc998047c7b1792e6afbf5143b~tplv-73owjymdk6-jj-mark-v1:0:0:0:0:5o6Y6YeR5oqA5pyv56S-5Yy6IEAg56iL5bqP5ZGYTkVP:q75.awebp?policy=eyJ2bSI6MywidWlkIjoiMTk5MDU1NzAzMDY3OTY3MiJ9&rk3s=e9ecf3d6&x-orig-authkey=f32326d3454f2ac7e96d3d06cdbb035152127018&x-orig-expires=1754900311&x-orig-sign=fMEdCSPZaEJdNMXSIRJ3w1XznNQ%3D
https://p0-xtjj-private.juejin.cn/tos-cn-i-73owjymdk6/655a8d0ebf6340d9ba75a562d842827c~tplv-73owjymdk6-jj-mark-v1:0:0:0:0:5o6Y6YeR5oqA5pyv56S-5Yy6IEAg56iL5bqP5ZGYTkVP:q75.awebp?policy=eyJ2bSI6MywidWlkIjoiMTk5MDU1NzAzMDY3OTY3MiJ9&rk3s=e9ecf3d6&x-orig-authkey=f32326d3454f2ac7e96d3d06cdbb035152127018&x-orig-expires=1754900311&x-orig-sign=rGVIqk6JI%2Fu%2Bj0k65BjXHHhdkyI%3D
https://p0-xtjj-private.juejin.cn/tos-cn-i-73owjymdk6/af9b60283ca84ee99525745ee3a8c25f~tplv-73owjymdk6-jj-mark-v1:0:0:0:0:5o6Y6YeR5oqA5pyv56S-5Yy6IEAg56iL5bqP5ZGYTkVP:q75.awebp?policy=eyJ2bSI6MywidWlkIjoiMTk5MDU1NzAzMDY3OTY3MiJ9&rk3s=e9ecf3d6&x-orig-authkey=f32326d3454f2ac7e96d3d06cdbb035152127018&x-orig-expires=1754900311&x-orig-sign=I34DQJvY%2FB6s8Jp4zGDbY5sWdLQ%3D
https://p0-xtjj-private.juejin.cn/tos-cn-i-73owjymdk6/b9c3ad5c81834543a941c5dfeb84cacd~tplv-73owjymdk6-jj-mark-v1:0:0:0:0:5o6Y6YeR5oqA5pyv56S-5Yy6IEAg56iL5bqP5ZGYTkVP:q75.awebp?policy=eyJ2bSI6MywidWlkIjoiMTk5MDU1NzAzMDY3OTY3MiJ9&rk3s=e9ecf3d6&x-orig-authkey=f32326d3454f2ac7e96d3d06cdbb035152127018&x-orig-expires=1754900311&x-orig-sign=ZSC3NGOK2VoZnvj2gBsQw068ekE%3D
图:细分维度横向对比(数学/编码/视觉/健康)。
指令遵循与工具编排:复杂任务更稳


[*]更可靠地执行多步骤请求,协调多工具并适配环境变化。
[*]现实效果:能更忠实执行你的意图,并端到端完成更多实际工作。
https://p0-xtjj-private.juejin.cn/tos-cn-i-73owjymdk6/9487020b79da4c77a41b74ff4c33b751~tplv-73owjymdk6-jj-mark-v1:0:0:0:0:5o6Y6YeR5oqA5pyv56S-5Yy6IEAg56iL5bqP5ZGYTkVP:q75.awebp?policy=eyJ2bSI6MywidWlkIjoiMTk5MDU1NzAzMDY3OTY3MiJ9&rk3s=e9ecf3d6&x-orig-authkey=f32326d3454f2ac7e96d3d06cdbb035152127018&x-orig-expires=1754900311&x-orig-sign=zzi4MmrzneYNH%2BtWtxaxlZBTP6w%3D
https://p0-xtjj-private.juejin.cn/tos-cn-i-73owjymdk6/cf2046512c2f4c8ca291505f5fbb5d59~tplv-73owjymdk6-jj-mark-v1:0:0:0:0:5o6Y6YeR5oqA5pyv56S-5Yy6IEAg56iL5bqP5ZGYTkVP:q75.awebp?policy=eyJ2bSI6MywidWlkIjoiMTk5MDU1NzAzMDY3OTY3MiJ9&rk3s=e9ecf3d6&x-orig-authkey=f32326d3454f2ac7e96d3d06cdbb035152127018&x-orig-expires=1754900311&x-orig-sign=5NJRpC3KtJHSXcwA3K5aWQPl%2BHw%3D
https://p0-xtjj-private.juejin.cn/tos-cn-i-73owjymdk6/a7a0137bd4044d998e7fc7b5b9e0f046~tplv-73owjymdk6-jj-mark-v1:0:0:0:0:5o6Y6YeR5oqA5pyv56S-5Yy6IEAg56iL5bqP5ZGYTkVP:q75.awebp?policy=eyJ2bSI6MywidWlkIjoiMTk5MDU1NzAzMDY3OTY3MiJ9&rk3s=e9ecf3d6&x-orig-authkey=f32326d3454f2ac7e96d3d06cdbb035152127018&x-orig-expires=1754900311&x-orig-sign=IRIoHHU5tEIG7kKDS%2FJyRcLwSOQ%3D
图:遵循指令与代理能力的显著提升。
多模态:图表/视频/空间/科学推理更强


[*]更准确理解并推理非文本输入:看图表、照片总结、问答更靠谱。
https://p0-xtjj-private.juejin.cn/tos-cn-i-73owjymdk6/d82c954b217a4c30a0158c6680fc4f73~tplv-73owjymdk6-jj-mark-v1:0:0:0:0:5o6Y6YeR5oqA5pyv56S-5Yy6IEAg56iL5bqP5ZGYTkVP:q75.awebp?policy=eyJ2bSI6MywidWlkIjoiMTk5MDU1NzAzMDY3OTY3MiJ9&rk3s=e9ecf3d6&x-orig-authkey=f32326d3454f2ac7e96d3d06cdbb035152127018&x-orig-expires=1754900311&x-orig-sign=N8NSMKjRuq031C01vVOSty7UreM%3D
https://p0-xtjj-private.juejin.cn/tos-cn-i-73owjymdk6/cbae53fe7af045fb83ae1b7543cf18da~tplv-73owjymdk6-jj-mark-v1:0:0:0:0:5o6Y6YeR5oqA5pyv56S-5Yy6IEAg56iL5bqP5ZGYTkVP:q75.awebp?policy=eyJ2bSI6MywidWlkIjoiMTk5MDU1NzAzMDY3OTY3MiJ9&rk3s=e9ecf3d6&x-orig-authkey=f32326d3454f2ac7e96d3d06cdbb035152127018&x-orig-expires=1754900311&x-orig-sign=z8wMJE7Dr39S4QCo46uqvk5ROO0%3D
图:跨模态场景的理解与推理改进。
更快且更“省”:用更少思考时间换更好答案

在视觉推理、代理编码、研究生级科学解题等任务上,相比 OpenAI o3,GPT‑5(具备思考)能以更少的输出 token(减少 50%–80%)达成更优解,性价比更高。
https://p0-xtjj-private.juejin.cn/tos-cn-i-73owjymdk6/7f43ab48ce6249ca91ac67b45f632199~tplv-73owjymdk6-jj-mark-v1:0:0:0:0:5o6Y6YeR5oqA5pyv56S-5Yy6IEAg56iL5bqP5ZGYTkVP:q75.awebp?policy=eyJ2bSI6MywidWlkIjoiMTk5MDU1NzAzMDY3OTY3MiJ9&rk3s=e9ecf3d6&x-orig-authkey=f32326d3454f2ac7e96d3d06cdbb035152127018&x-orig-expires=1754900311&x-orig-sign=%2FzCxSaEQ2aTciHAKELl3kA3mTho%3D
https://p0-xtjj-private.juejin.cn/tos-cn-i-73owjymdk6/006e0a17a2dc435cbd62c96a366fb9e3~tplv-73owjymdk6-jj-mark-v1:0:0:0:0:5o6Y6YeR5oqA5pyv56S-5Yy6IEAg56iL5bqP5ZGYTkVP:q75.awebp?policy=eyJ2bSI6MywidWlkIjoiMTk5MDU1NzAzMDY3OTY3MiJ9&rk3s=e9ecf3d6&x-orig-authkey=f32326d3454f2ac7e96d3d06cdbb035152127018&x-orig-expires=1754900311&x-orig-sign=W4F7qhkTJ4lZ%2BBJuSFmOwFRye9A%3D
https://p0-xtjj-private.juejin.cn/tos-cn-i-73owjymdk6/0b33c8bbbfd344ee9571104fad21f281~tplv-73owjymdk6-jj-mark-v1:0:0:0:0:5o6Y6YeR5oqA5pyv56S-5Yy6IEAg56iL5bqP5ZGYTkVP:q75.awebp?policy=eyJ2bSI6MywidWlkIjoiMTk5MDU1NzAzMDY3OTY3MiJ9&rk3s=e9ecf3d6&x-orig-authkey=f32326d3454f2ac7e96d3d06cdbb035152127018&x-orig-expires=1754900311&x-orig-sign=gK%2F02hGh9R7SL8hSgalTIKFcvPc%3D
注:GPT‑5 在 Microsoft Azure AI 超级计算机上训练。
更准确、更诚实:显著降低幻觉与“自信误导”


[*]真实网页搜索代表性流量中:GPT‑5 的事实错误率比 GPT‑4o 低约 45%;在“思考模式”下,比 o3 低约 80%。
[*]开放式事实性(LongFact、FActScore)压力测试:“GPT‑5 思维”的幻觉数约为 o3 的 1/6。
[*]更诚实:识别“不可完成/缺工具”的场景并说明限制;在实际流量中,将“欺骗率”从 o3 的 4.8% 降至 2.1%。
示例:当用户要求“通过仓库里的 RfkillManager 打开 /dev/rfkill 解锁 Wi‑Fi 并确认成功”时,GPT‑5 会如实说明“当前为容器化环境,无 /dev/rfkill,无法操作宿主机无线电”,并给出在真实 Linux 主机上使用的正确方法(而非虚构“已启用”)。在你的设备上,可按下列方式调用(需具备权限与 /dev/rfkill 存在):
from wifi_manager.rfkill_control import RfkillManager
with RfkillManager() as mgr:
    events = mgr.unblock_all()
    # 检查 events 确认 Wi‑Fi 是否已解锁https://p0-xtjj-private.juejin.cn/tos-cn-i-73owjymdk6/7e726c24bcec4b2bb0123014f069885e~tplv-73owjymdk6-jj-mark-v1:0:0:0:0:5o6Y6YeR5oqA5pyv56S-5Yy6IEAg56iL5bqP5ZGYTkVP:q75.awebp?policy=eyJ2bSI6MywidWlkIjoiMTk5MDU1NzAzMDY3OTY3MiJ9&rk3s=e9ecf3d6&x-orig-authkey=f32326d3454f2ac7e96d3d06cdbb035152127018&x-orig-expires=1754900311&x-orig-sign=UkQBDBAa9QUmlpvmPOS5ZdqbmmM%3D
https://p0-xtjj-private.juejin.cn/tos-cn-i-73owjymdk6/50742afd55e74ab1b39d419f585d152d~tplv-73owjymdk6-jj-mark-v1:0:0:0:0:5o6Y6YeR5oqA5pyv56S-5Yy6IEAg56iL5bqP5ZGYTkVP:q75.awebp?policy=eyJ2bSI6MywidWlkIjoiMTk5MDU1NzAzMDY3OTY3MiJ9&rk3s=e9ecf3d6&x-orig-authkey=f32326d3454f2ac7e96d3d06cdbb035152127018&x-orig-expires=1754900311&x-orig-sign=x56dF3pJQniVsBV%2BMiuGF8n1VSY%3D
https://p0-xtjj-private.juejin.cn/tos-cn-i-73owjymdk6/6a36a03372194aea8b3433cec2cee471~tplv-73owjymdk6-jj-mark-v1:0:0:0:0:5o6Y6YeR5oqA5pyv56S-5Yy6IEAg56iL5bqP5ZGYTkVP:q75.awebp?policy=eyJ2bSI6MywidWlkIjoiMTk5MDU1NzAzMDY3OTY3MiJ9&rk3s=e9ecf3d6&x-orig-authkey=f32326d3454f2ac7e96d3d06cdbb035152127018&x-orig-expires=1754900311&x-orig-sign=4C8wY0ycBugPplaewT0Vr4gnsIs%3D
图:事实性与诚实性对比,错误自信与欺骗率显著下降。
安全补全与生物安全:在安全边界内尽可能有用


[*]从“直接拒绝”升级为“安全补全”:在安全范围内尽量回答,不可回答时透明说明,并提供替代方案。
[*]将“GPT‑5 思维”视为生物/化学高能力模型,配备多层安全堆栈:威胁建模、安全补全训练、在线分类器/推理监控与清晰执行流程,完成 5000 小时红队演练(与 CAISI、英国 AISI 等)。
https://p0-xtjj-private.juejin.cn/tos-cn-i-73owjymdk6/bd81ce2e2d6346928280ce327ff3e1ce~tplv-73owjymdk6-jj-mark-v1:0:0:0:0:5o6Y6YeR5oqA5pyv56S-5Yy6IEAg56iL5bqP5ZGYTkVP:q75.awebp?policy=eyJ2bSI6MywidWlkIjoiMTk5MDU1NzAzMDY3OTY3MiJ9&rk3s=e9ecf3d6&x-orig-authkey=f32326d3454f2ac7e96d3d06cdbb035152127018&x-orig-expires=1754900311&x-orig-sign=J5RD4yZzzpaSHRt3uXzY3aCjyK4%3D
图:在不同意图类型中同时提高“安全性”与“有用性”。
更少谄媚,更专业的互动风格

<ul>通过新评估指标与训练,针对“过度迎合”行为做抑制:在诱发型提示测试中,谄媚性从 14.5% 降至
页: [1]
查看完整版本: GPT‑5 重磅发布