AI safety
Before any of this, the team should run a design sprint to develop the character itself, covering what it is, how it behaves, who it serves, and how it sounds. That work has its own process.This document picks up from there.
It is specifically for thinking about safety, and it is the checklist I want us to work through before a consumer facing character goes live in front of real users. Some of this happens before the safety workshop, some of it is the workshop itself, and the rest runs after launch and keeps running. Work top to bottom.
Items marked with an asterisk are specific to Figurate. There is a short note on each one at the end of the doc.
1. Decide what the character is for
A character with a narrow, well defined job is much safer. Pin this down before the workshop so the team has something concrete to work against.
2. Run the workshop
3. Write the guard rails
4. Pick the technology and use the platform safely
5. Test the character
6. Launch
7. Run it after launch
8. Write it down
Figurate is the tool we use to build, test, and run AI characters. The items marked with an asterisk above are parts of it. Here is what they mean if you are reading this from outside the team.
The Director is the runtime control plane. It reads what the user is trying to do and decides how the conversation gets handled. This is where the real guard rails belong, because it sits below the character's prompt and a user cannot talk their way around it.
Legacy mode is the Off setting for the Director. It is the simpler fallback we can drop back to if something goes wrong.
The knowledge base is the reference material we write for a character to draw on. The character treats it as true, so it has to be accurate.
Memory is what the character learns about a user over time. It is held in five tiers, from stable identity facts down to passing context, and it is scoped per user so it does not leak between people.
The review path is the step that new memory writes pass through before they are kept, so the character does not quietly learn things unchecked.
Receipts and the ledger are the running record of important decisions and side effects. Every memory write and every routing decision can leave a receipt, and that is what lets us inspect the system after the fact instead of guessing from a transcript.
The voice lane is the live voice way of reaching a character. It runs alongside the text lane and shares the same character identity and context.The reason this matters for safety is simple. A lot of teams try to keep a character in line with prompt wording alone. In Figurate the guard rails, the audit trail, and the off ramp are real parts of the system, so the checklist above leans on them on purpose.