can you find performance benchamrkts for models especially openai 5.4?

7 March 2026

AI Models & Capabilities

The provided sources do not contain specific performance benchmarks for an OpenAI model version 5.4. However, they reference benchmarks for related OpenAI models in comparative contexts. For instance, Z.ai's GLM-5, a 744-754 billion-parameter open-source model, scores competitively against GPT-5.2 on various benchmarks, outperforming models like Gemini 3 Pro and Grok 4 while achieving 80-90% of frontier-model performance overall [1][3][5]. Similarly, MiniMax's M2.5 matches GPT-5.2 and Claude Opus 4.6 on coding benchmarks, with noted gaps in code generation and wide knowledge for Chinese models compared to US closed-source ones like those from OpenAI [4][8]. One direct performance metric for an OpenAI variant is GPT-5.3-Codex-Spark, which achieves 1,000 tokens per second inference speed on Cerebras CS3 accelerators [2]. Broader discussions highlight challenges in AI benchmarking, such as overemphasis on coding tasks and the need for end-to-end evaluations like data science projects, but no GDPval or other quantitative scores are provided for OpenAI 5.x models [6][10][11].

Sources

Chinese AI Model Challenges US Margins — GAI Insights Newsletter
OpenAI dishes out its first model on a plate of Cerebras silicon — theregister
Z.ai's GLM-5 Challenges Western AI with Competitive Pricing — The Rundown AI
MiniMax's M2.5 Offers Low-Cost Frontier AI Coding — The Rundown
Z.ai Launches GLM-5 with Competitive Pricing — The Rundown AI
Has anyone actually benchmarked AI ability with any of the default knowledge work skills shipping with Claude Cowork? Does it increase GDPval scores over default 4.6? (Not GDPval-AA) It seems worth testing for real, given that the market freaks out every time they ship skills. — @emollick
I have to praise both @METR_Evals & @EpochAIResearch for doing a great job on benchmarking AI ability and also being transparent about how challenging this kind of benchmarking is, & how, exactly, they do it (and also making data available). Very rare in the AI benchmarking world — @emollick
Impressive benchmarks for the new Chinese LLM. The system card notes some gaps with US closed source models in code generation & wide knowledge, so be interested to see it in operation. — @emollick
LFM2-24B-A2B Model Released — Daily AI News February 25, 2026
Benchmarking AI Performance on End-to-End Data Science Projects — arXiv
What a great illustration of the central problem of AI benchmarking for real work All of the effort is going into benchmarking for coding, but that is a small part of the actual jobs people do, which leaves the true trajectory of AI progress less clear. — @emollick
I think it is entirely possible that there will be no new frontier open weights models at some point in the near future. Counting on the Chinese AI labs to keep making their models free forever doesn’t make sense as model costs rise & the value of having a frontier model goes up — @emollick
New OpenAI GPT-5.4 AI Model : Everything You Need to Know — Geeky Gadgets
GPT-5.4 Review: OpenAI's Best Model Yet (Full Breakdown) — The Neuron
GPT-5.4 (xhigh) - Intelligence, Performance & Price Analysis — Artificial Analysis
OpenAI Launches GPT-5.4 With Expanded Context Window, Improved Reasoning, and Higher Benchmark Performance — AI Insider
OpenAI Reclaims Benchmark Lead with GPT-5.4 Release — techstrong.ai
The New Best Model Is Here (GPT-5.4) - YouTube — youtube.com
GPT-5.4: What I Verified With Code | by Lakshmi narayana .U — medium.com
Introducing GPT-5.4 — OpenAI
OpenAI launches GPT-5.4 Thinking and Pro combining coding, reasoning, and computer use in one model — The Decoder
OpenAI launches GPT-5.4: reasoning, coding, and computer use in one — Techzine Global
OpenAI’s GPT-5.4 sets new records on professional benchmarks — The Next Web
OpenAI launches GPT-5.4 with Pro and Thinking versions — TechCrunch
OpenAI launches GPT-5.4 with native computer use mode, financial plugins for Microsoft Excel, Google Sheets — Venturebeat
OpenAI's new GPT-5.4 clobbers humans on pro-level work in tests - by 83% — ZDNET
OpenAI, in Desperate Need of a Win, Launches GPT-5.4 — Gizmodo
GPT-5.4 is here — and OpenAI just made every other AI model look slow — Tom's Guide
GPT-5.4 Targets Anthropic’s Claude With Premium Pricing and Coding Muscle — www.trendingtopics.eu

can you find performance benchamrkts for models especially openai 5.4?

Sources

Related questions

Any AI question.
Board-grade answers.