Prompt City
A Google-wide prompt library that I reframed from a sharing tool into a literacy and governance platform — full visual design system and UX architecture delivered in one month as sole designer.
Prompt City started as an internal prompt library — a place where Google engineers and Cloud teams could share and discover prompts for common tasks. When I joined, the core concept was sound but the product framing was wrong. A sharing tool optimizes for discoverability. A literacy platform optimizes for understanding. Those are different products.
I reframed Prompt City as a prompt literacy and governance platform: not just where you find prompts, but where you understand why they work, how to improve them, and how to reason about prompt quality before you deploy.
The Five-Component Anatomy
The foundation of the redesign was a semantic component system — a framework that decomposed any prompt into five distinct dimensions:
- Role Priming — establishing the model's persona, expertise level, and perspective
- Instruction — the explicit task or request
- Conversation Flow — how the model should handle multi-turn dialogue, clarifying questions, and follow-ups
- Context & Background — the situational information the model needs to respond appropriately
- Output Format — the structure, length, and format of the desired response
Every prompt in the system was annotated against these five dimensions. Users could see at a glance which components a prompt used, how heavily each was weighted, and what would happen to the output if any component was weakened or removed.
Interactive Attribute Tuners
The most original interaction design in the system: real-time Attribute Tuners that allowed users to adjust prompt components along semantic spectra and see the output change live.
Each tuner controlled a specific dimension of prompt behavior:
- Detail Level: Broad → Specific → Atomic
- Rigidity: Flexible → Structured → Strict
- Tone: Casual → Professional → Formal
- Verbosity: Concise → Standard → Comprehensive
As users adjusted a tuner, two things happened simultaneously: the prompt text rewrote itself to reflect the new setting, and a justification note explained what changed and why. Users weren't just getting a different prompt — they were learning why the change produced a different output.
This was the core literacy mechanism: the system taught prompt engineering principles through direct manipulation rather than documentation.
Prompt Strength Score
I designed a weighted composite metric — the Prompt Strength Score — that evaluated prompt quality across the five component dimensions:
- Role Priming: 30%
- Instructions: 30%
- Context Depth: 20%
- Output Formatting: 20%
The score was accompanied by auto-suggestions for improvement, an estimated turn count (how many back-and-forth exchanges the prompt would require before producing usable output), and auto-tagging that categorized the prompt by use case, model compatibility, and task type.
The Guided Builder
For users new to prompt engineering, I designed a Guided Builder — a structured creation flow that walked users through each of the five components in sequence, with examples and scaffolding at each step. Users could start from scratch or from a template, and the builder would flag missing components and suggest additions before the prompt was published.
Conversation Quality Index
The open question in prompt evaluation: how do you predict the quality of a conversation before it happens? I developed the Conversation Quality Index (CQI) — a predictive metric built on three dimensions:
- Structural completeness — are all five components present and appropriately weighted?
- Specificity gradient — does each component add unique constraint, or do components overlap?
- Ambiguity surface area — how many valid interpretations does the prompt admit?
CQI above 0.78 predicted first-pass acceptance above 85% in Prompt City pilot data. The metric became the foundation for auto-suggestions and the governance layer that flagged low-quality prompts before they were published to the shared library.
Delivery
Sole designer on a four-person part-time engineering team. Full visual design system and UX architecture delivered within one month of joining the project. The system's prompt and component architecture was subsequently incorporated into multiple Google AI support libraries.