Memory Space Discovery
Guide the user through understanding their memory space needs. Don't proceed to generation until you have confidence across all dimensions.
Memory spaces store KNOWLEDGE, not files.
A memory space captures what you've learned, decided, and understood—synthesized from experience. Raw files (PDFs, receipts, screenshots) go in assets/. The main space holds the distilled knowledge: summaries, decisions, insights, learnings.
Think: "What would I want an AI to remember about this area of my life?"
IMPORTANT: Discovery is about STRUCTURE, not DATA.
You're trying to understand the shape of the space—what to remember and how it evolves. You are NOT collecting the user's actual information yet.
Before asking a question, ask yourself: "Will the answer change the folder/file structure?"
- If NO → Don't ask it. It's data collection, not structure discovery.
- If YES → Ask it.
Examples:
| Question | Changes structure? | Ask it? |
|---|---|---|
| "Do you need separate folders per country?" | Yes - adds subdirectories | Yes |
| "Will you have multiple income sources?" | No - still goes in income/ | No |
| "Do documents move through stages?" | Yes - affects lifecycle/folders | Yes |
| "What's your filing status?" | No - just data | No |
| "Track deductions throughout year or just at filing?" | Yes - single file vs collection | Yes |
| "How many employers do you have?" | No - just quantity | No |
Discovery Dimensions
1. Purpose (Start Here)
What problem does this space solve? Get a crisp one-sentence answer.
Good purposes are bounded:
- "Track my blog from ideas through publication"
- "Manage my therapy journey—sessions, insights, homework"
- "Remember everything relevant to my taxes—filings, deductions, decisions"
Bad purposes are vague:
- "Keep track of stuff" → Ask: "What kind of stuff? Give me an example."
- "Everything about my life" → Push back: "That's too broad. What's the most pressing area?"
2. Entities (What Gets Tracked)
What are the "things" in this space? These become files or collections.
Ask: "If this space were a database, what would the tables be?"
Examples by space type:
- Blog: ideas, drafts, published posts, style notes
- Therapy: sessions, insights, homework, goals, medication
- Project: tasks, decisions, stakeholders, milestones, risks
- Relationship: interactions, commitments, shared history, boundaries
For each entity, understand:
- Is it singular (one file) or plural (many files)?
- What fields/sections does each entry have?
- How does it relate to other entities?
3. Lifecycle (How Things Move)
Things rarely stay static. Understand the state machine.
Ask: "Walk me through the journey of a [primary entity] from creation to completion."
Common patterns:
- Linear: idea → draft → review → published → archived
- Cyclical: session → reflection → insight → (feeds next session)
- Accumulative: entries just pile up (journal, log)
- Branching: single item splits into many (one idea → multiple posts)
For each state transition:
- What triggers the move?
- What happens to the old state? (deleted? archived? transformed?)
- Is there ever going back?
4. Access Patterns (How You'll Query)
How will you ask questions of this data?
Common queries:
- By time: "What happened last month?"
- By status: "What's in draft?"
- By entity: "Everything about [person/topic]"
- By relationship: "What sessions mentioned anxiety?"
This informs structure:
- Time-based queries → date-based file names or folders (YYYY-MM-DD)
- Status-based → status field in frontmatter, or separate folders per status
- Entity-based → one file per entity with slug naming
- Relationship-based → tags or explicit links
5. Scale (How Much, How Fast)
Estimate volume to prevent premature optimization AND avoid future restructuring.
Ask:
- "How many [entities] do you expect after 1 year?"
- "How often will you add new entries?"
Rules of thumb:
- <20 items: Single file with sections is fine
- 20-100 items: Individual files in one folder
- 100+ items: Subdirectories (by year, category, status)
- 1000+ items: Consider if this should be multiple spaces
6. Boundaries (What NOT to Store)
Every space has edges. Define them explicitly to prevent sprawl.
Ask: "What might seem related but actually belongs somewhere else?"
Common boundary issues:
- Blog space: Research notes (separate research space), personal journal (separate)
- Therapy space: General journaling (separate), medical records (too sensitive?)
- Work project: Personal tasks (separate), other team's work (separate)
Also ask: "What would be inappropriate to store here?"
- Sensitive credentials
- Data that changes too fast (live metrics)
- Duplicates of data that lives elsewhere
7. Capture Opportunities (What to Listen For)
What moments in conversation signal data worth saving? These become proactive prompts.
Ask: "When would you want me to offer to save something to this space?"
Common capture triggers by space type:
- Blog: "I have an idea for a post about...", mentions finishing a draft
- Therapy: "I had a session today", describes an insight or breakthrough
- Project: "We decided to...", new risk identified, stakeholder update
- Relationship: Significant interaction happened, commitment made
For each capture opportunity, understand:
- What triggers it? (keywords, context, explicit mention)
- Where does it go? (which location in the schema)
- What fields to ask about? (the key data to capture)
- How to prompt? (natural language offer)
8. Useful Tasks (What AI Can Help With)
What recurring tasks would benefit from AI assistance? These become suggested actions.
Ask: "What would you want me to proactively offer to help with?"
Common task patterns by space type:
- Blog: Pre-publish checklist, idea brainstorm, draft review
- Therapy: Session prep (review recent history), progress summary, pattern analysis
- Project: Stakeholder prep, decision summary, risk review
- Relationship: Pre-meeting review, gift brainstorm, commitment check
For each suggested task, understand:
- What's the task? (clear name and description)
- When to offer it? (time-based, event-based, on request)
- What data does it use? (which locations to read)
Confidence Gate
Before proceeding to generation, you must be able to articulate:
- One-sentence purpose that a stranger would understand
- 3-5 core locations with their types (file vs collection)
- Primary lifecycle showing how main entity moves through states
- Clear boundaries of what this space is NOT for
- 2-3 capture hints showing what triggers data capture
- 2-3 suggested tasks showing how AI can proactively help
If you can't articulate all six, ask more questions.
Probing Questions
When user is vague, try these:
For purpose:
- "If this space magically existed, what would you do with it tomorrow?"
- "What's frustrating you that this space would fix?"
For entities:
- "Show me an example of something you'd put in here."
- "What would the most important file be called?"
For lifecycle:
- "What happens when you're done with a [entity]?"
- "Do things ever come back from 'done'?"
For boundaries:
- "What's similar to this but should stay separate?"
- "What would be the wrong thing to put here?"
For capture opportunities:
- "What would you tell me that should trigger me to offer saving something?"
- "When you mention [entity], what details should I ask about?"
For useful tasks:
- "What recurring task would you love help with?"
- "Before [event], what would you want me to prepare for you?"
Space Type Patterns
Common archetypes to recognize:
| Type | Key Feature | Example Entities |
|---|---|---|
| Creative | Ideas → artifacts | ideas, drafts, published, style |
| Relational | People + interactions | people, interactions, commitments |
| Process | Steps → outcome | stages, tasks, decisions, outcomes |
| Learning | Topics + understanding | topics, notes, questions, insights |
| Administrative | Knowledge + deadlines | filings, deductions, decisions, contacts |
| Therapeutic | Sessions + growth | sessions, insights, goals, progress |
Use these patterns to suggest structure, but don't force fit. User's actual needs take precedence.