Does model-agnostic mean lowest-common-denominator quality?

No. Routing per task means you can send each step to the model that does it best — a strong reasoning model for a hard decision, a fast or cheaper model for a high-volume classification — rather than forcing one model to do everything. The grounding, approvals, and audit trail are the same regardless of which model answered.

What happens when a provider has an outage?

Work fails over to an alternate provider rather than stalling. Because the platform owns intake, grounding, policy, and the audit trail, swapping the underlying model is an implementation detail — the WorkItem, its evidence, and its approval gates are unchanged.

Doesn't using a vendor's model mean you are tied to that vendor anyway?

Threada uses providers like OpenAI and Google's Gemini for inference and embeddings, but the platform — not the vendor — owns the work. You can change provider per task or in response to price, latency, or availability without changing how work is governed, which is the opposite of building on a single vendor's assistant.

Don't Marry One Model: Why a Work Platform Routes Across Providers

A lot of teams are quietly betting their operations on one model vendor. They build on a single assistant, wire their workflows to one provider’s quirks, and inherit that vendor’s outages, price changes, deprecations, and rate limits. That is not a capability. It is a dependency you did not negotiate.

Threada is built the other way around: the platform owns the work, and the model is a swappable component underneath it. Being model-agnostic — routing across providers with fail-over — is a deliberate architecture decision, and it buys an operations team three things that matter.

Reliability: an outage shouldn’t stop the work

When your automation is one API away from a single provider, that provider’s bad afternoon is your bad afternoon. Routing across providers means a degraded or down model fails over to an alternate rather than stalling the queue.

This is only possible because the model is not where the important state lives. In Threada, intake normalizes into a WorkItem, answers are grounded in your cited sources, sensitive outcomes pass approval gates, and every step lands on an audit trail. The model produces a draft answer or a proposed action inside that structure. Swap the model and the WorkItem, its evidence, and its approvals are unchanged — so fail-over is a routing decision, not a re-architecture.

Fit: send each task to the model that does it best

“One model for everything” is a constraint, not a simplification. Real operational work is a mix: a high-volume intake classification, a careful extraction, a hard grounded answer, a policy-sensitive action proposal. Those have different cost, latency, and accuracy profiles.

Routing per task lets you match each step to the right model — a strong reasoning model where judgment matters, a faster or cheaper model where throughput matters — instead of overpaying a frontier model for a trivial classification or asking a small model to do something it can’t. The governance around the step does not change with the model: grounding, citations, approvals, and the receipt are the platform’s job, not the model’s.

Longevity: the frontier moves, your operation shouldn’t have to

The model you would pick today is not the one you will pick in a year. New models ship, prices fall, context windows grow, and providers deprecate old endpoints. If your operation is welded to one model, every one of those events is a migration project.

If the platform owns the work and treats the model as a component, adopting a better model is a configuration change you can put through an evaluation gate — run the new model against your labeled dataset, confirm it does not regress extraction, routing, grounding, or action behavior, and promote it behind canary traffic. The frontier moving becomes an upgrade you opt into on your terms, not a fire drill.

What “model-agnostic” does and does not mean

It does not mean avoiding the best models — Threada uses providers like OpenAI and Google’s Gemini for inference and embeddings, and routing means you can use the strongest model for a given task. It does not mean lowest-common-denominator quality. And it does not mean you never depend on a provider for a single call.

What it means is that no single vendor owns your operation. The intake, the structured WorkItem, the grounded evidence, the approvals, and the audit trail belong to the platform and stay constant. The model is the part you can change — per task, in response to an outage, or to adopt something better — without changing how the work is governed.

That is the difference between buying a vendor’s assistant and running a work platform: with the assistant, the vendor’s model is the product, so their roadmap is your roadmap. With a platform, the model is an input, and you keep the right to choose it.