Security & data handling

Your data stays
where it belongs.

Most AI agent vendors will quietly fine-tune their model on your customer conversations, route your API calls through their proxy, and treat your data as fuel. Forge does none of that. Here's the full data-handling matrix in plain English — and the contract clauses we'll sign to back it.

Data handling matrix

What touches what — by category.

For every category of data your build will use, here is who sees it, where it lives, and how long it persists. If a row breaks your policy, tell us at discovery — we'll engineer around it or refuse the engagement.

Data categoryForge sees it?Stored where?Used to train models?Retention
Source codeYes (read-write to your repo)Your GitHubNeverYours forever
API keys / secretsScoped credentials in build env onlyYour secrets manager (AWS SM / Vault / 1Password)NeverRotated post-launch
Production customer PIINo — synthetic + masked test data onlyYour infra exclusivelyNeverN/A — never leaves your stack
Discovery transcriptsYes (recorded screen-share)Encrypted on Forge laptop, mirrored to your shared driveNeverDeleted day 90
Test datasetsYesYour sandbox envNeverYours forever
LLM prompts (build phase)Yes (engineered)Your repoNo (we use API tier with no-train flags)Yours forever
LLM completions (runtime)No (ships in your code, not ours)Your infraNoPer your retention policy
Slack channel contentYes (private channel)Your Slack workspaceNeverPer your Slack retention
Monitoring logsDuring support window onlyYour observability stackNeverPer your log policy
Invoices / contractsYesForge accounting (Stripe + bookkeeping)Never7 years (CRA tax requirement)
Five contractual commitments

What we'll sign in the SOW.

Every Forge Statement of Work includes these five clauses. They're not negotiable in the way that they get watered down — they're negotiable in the way that you can ask us to make them stricter.

1

No training on your data

Forge will never use your source code, customer data, prompts, completions, or transcripts to train, fine-tune, or evaluate any model — internal or external. LLM API calls during build use the OpenAI/Anthropic enterprise tiers with explicit no-training flags.

2

Scoped credentials only

We work against API keys with the minimum scope required (read-only where possible, write only on the surface we're shipping to). Credentials live in your secrets manager. We don't copy them to a Forge laptop.

3

Synthetic test data by default

Discovery starts with masked or synthetic data. Real production data only enters the build environment if your security team explicitly approves a path. Most builds never need it.

4

Day-90 transcript purge

Discovery recordings, internal Forge notes, and Slack channel exports are purged 90 days after final delivery. Notarised purge log on request.

5

Incident notification within 24h

If Forge becomes aware of any potential exposure of your data — accidental commit of a secret, lost laptop, sub-vendor breach — you're notified within 24 hours with an incident report. No legal-team gymnastics, no 'pending investigation' stalling.

6

Right-to-audit clause

You have the right to audit Forge's handling of your data with 30 days' notice. Most enterprise vendors fight this clause. We don't. Bring your own auditor or use ours; you pay theirs, we pay ours.

What we won't do

The four data-handling lines we won't cross.

Some asks come up often enough that it's simpler to publish the answer here.

Common ask

Can you ship the code through a Forge-hosted relay so we don't have to manage credentials?

Forge answer

No. The relay becomes a single point of failure with your data passing through it. Your credentials, your infrastructure. We'll help you set up the secrets manager if you don't have one.

Common ask

We'd like Forge to keep the agent code on your servers and license it back to us.

Forge answer

No. Code transfers to your GitHub on day 28. The whole point of Forge is that you own the artefact. If you want a hosted SaaS, hire a SaaS vendor instead.

Common ask

Can you use ChatGPT (consumer) for prompt iteration?

Forge answer

No. We use only enterprise/API tiers with no-training flags on every model provider. Consumer ChatGPT data flows are explicitly disallowed in the SOW.

Common ask

Can the agent log full customer conversations to a Forge dashboard for monitoring?

Forge answer

No. Monitoring goes to your observability stack (Datadog / New Relic / OpenTelemetry). Forge sees logs only during the contracted post-launch support window, scoped to debugging — never aggregate customer data.

Sub-processors

Every vendor in the build path — listed.

For your DPIA / GRC team. Updated when the stack changes.

VendorPurposeData classRegion
OpenAI (API tier)LLM inference during build (no-train mode)Engineered prompts onlyUS
Anthropic (API tier)LLM inference during build (no-train mode)Engineered prompts onlyUS
GitHubSource code hosting (your account)Source code, your repoUS
LinearInternal task trackingEngagement metadata, no customer dataUS
1Password BusinessForge internal secrets (not yours)Forge employee creds onlyCA
StripeInvoicing & paymentBilling contact + line itemsUS
CloudflareForge marketing site (this site)Public marketing onlyGlobal

Need a signed DPA, NDA, or vendor risk questionnaire completed? Reply to your discovery email — most are turned around in 48 hours.

Discovery includes a security-team intro.

We default to a 30-minute call with your security or compliance lead during week 1 — to walk through the matrix, sub-processors, and any custom clauses your DPA needs. Free, scheduled with the discovery, lands you a signed SOW that doesn't get blocked at procurement.

Request a security walk-through →Back to Forge