More

bahaAbunojaim · 2025-12-29T16:54:22 1767027262

We are actively working on it, hopefully will get something out in Jan with additional providers like cursor added

bahaAbunojaim · 2025-12-28T20:14:40 1766952880

UPDATE: Mysti 0.2.2 Release

Hey HN! Quick update on Mysti based on your feedback:

1- Mysti now supports GitHub Copilot CLI as a fourth provider. So you can now do Claude Code + Copilot (running GPT-5) in Brainstorm mode, or any combination of the 4 providers. Mix and match based on what catches different issues.

2- Mysti is now MIT Licensed. Switched from BSL 1.1 to MIT. 3- Better Auth UX When a CLI isn't authenticated, you now get a friendly error with one-click "Open Terminal & Authenticate" instead of cryptic CLI errors.

bahaAbunojaim · 2025-12-28T09:33:33 1766914413

Thank you so much and would love to hear your feedback anytime

bahaAbunojaim · 2025-12-28T08:12:57 1766909577

Very true, Claude also tend to struggle around the context window limit and after compact

bahaAbunojaim · 2025-12-28T07:35:32 1766907332

I was working on a project where I tried Claude code to optimize processing of taichi Kernel and it kept using structure that didn’t work with taichi lang limitations so it kept going on a loop, did the same with codex and faced the same issue then tried to have both agents discuss it and it worked! It saved me several hours

bahaAbunojaim · 2025-12-28T06:44:07 1766904247

Link fixed

bahaAbunojaim · 2025-12-28T06:39:32 1766903972

Thanks! You would need to instruct the agents to follow best practices and explain it while developing. Sometimes they get messy but if you use the right instructions/persona/skills then you will get very good results

A final review from experienced developers is always recommended

bahaAbunojaim · 2025-12-28T06:34:21 1766903661

Converted it to MIT so it is MIT now

nextaccountic · 2025-12-28T16:47:16 1766940436

That's quite nice of you!

bahaAbunojaim · 2025-12-30T10:33:25 1767090805

Thanks!

bahaAbunojaim · 2025-12-28T06:33:50 1766903630

I find it useful to let one agent come up with a plan after a review and another agent implementing the plan. For example, Gemini reviewing the code, codex writing a plan and then Claude code implementing it

mycall · 2025-12-28T15:06:48 1766934408

What about the reverse, after Claude code implements it, let Gemini/Codex do a code review for bugs and architecture revisions? I found it is important to prompt to only make absolutely minimal changes to the working code, or unwanted code clobbering will happen.

bahaAbunojaim · 2025-12-29T07:35:22 1766993722

That works great too. Will be adding the ability to tag another agent in a near release

bahaAbunojaim · 2025-12-28T06:28:47 1766903327

Haven’t done Evals yet but measured on few real world situations where projects got stuck and the brainstorm mode solved it. Definitely running evals is something worth doing and contributions are welcomed

I think what really degrades the output is the context length vs context window limits, check out NoLima

danpalmer · 2025-12-29T04:12:37 1766981557

https://www.arxiv.org/abs/2512.08296

> coordination yields diminishing or negative returns once single-agent baselines exceed ~45%

This is going to be the big thing to overcome, and without actually measuring it all we're doing is AI astrology.

bahaAbunojaim · 2025-12-29T07:39:10 1766993950

This is why context optimization is going to be critical and thank you so much for sharing this paper as this also validates what we are trying to do. So if we managed to keep the baseline below 40% through context optimization then coordination might actually work very well and helps at scaling agentic systems.

I agree on measuring and it is planned especially once we integrate the context optimization. I think the value of context optimization will go beyond just avoiding compacting and reducing cost to providing more reliable agents.