shubho-learning-bonkers

Shared from "Bonkers" on Inkdown

Bonkers — Interview Revision Sheet

Last updated: June 11, 2026 · Final pass — principal engineer review Use: Read top-to-bottom once, then §0 Cheat Sheet + §12 Scripts before interview Mental model: Bonkers = product · Wallflower = backend engine · Your work = v2→v3 rebuild

0. Cheat Sheet (60-second scan)

vs	Bonkers wins because
ChatGPT/Gemini	Image IS the product — 8 edit features, gallery, templates
Midjourney	Multi-provider web app, not Discord-locked
Ideogram/Flux playgrounds	Abstract model IDs + cross-provider fallback
Canva/Firefly	AI-native; templates = generation configs, not static files

Dimension	v2	v3 (your rebuild)
Core action	Text→image once	Template → generate → edit → remix → repeat
Pipeline	Separate controllers per feature	Unified `unified-generation.controller.ts`
Input	Text only	Text + image upload + mask
Models	Limited	20+ models, abstract IDs
Templates	None / saved prompts	First-class Firestore configs
Performance	Serial	Parallel post-processing, smart routing (-30%)
Retention	Low	Repeat workflows (+50% DAU)
Routes	`/{lang}/old-bonkers` · `/image-generation`	`/{lang}/bonkers` · `/unified-generation`

User-facing	Abstract ID	Resolves to	Query cost
Bonkers Lite	`bonkers-lite`	`prunaai/hidream-l1-fast`	10 / image
Bonkers Advance	`bonkers-advance`	`fal-ai/ideogram/v3`	120 / image
Bonkers Magic Fill	`bonkers-magic-fill`	`fal-ai/ideogram/v3/edit`	(inpaint tier)

#	Feature	Provider / model	Resilience	One-liner
1	GENERATE	Any model	✅ Replicate ↔ FAL fallback	20+ models, auto-failover
2	INPAINT	Mask fill	⚠️ Photon SPOF for Replicate	Cross-provider mask inversion
3	UPSCALE	`fal-ai/clarity-upscaler`	❌ FAL only	One-click resolution boost
4	ERASE	`fal-ai/bria/eraser`	❌ FAL only	AI object removal
5	EDIT_BG	`fal-ai/bria/background/replace`	❌ FAL only	Background swap, no subject refs
6	OMNI_EDIT	`fal-ai/flux-pro/kontext/multi`	❌ FAL only	Multi-image edit
7	TEMPLATE	Per-template provider	⚠️ Provider lock	One-click styled generation
8	REMIX	GPT Image 1	❌ OpenAI only	Image→image (your v3 addition)

Layer	Mechanism	Interview detail
Abstract models	Zod `.transform()`	See §3
Fallback	`FALLBACK_MODELS_MAP` + sequential `callWithFallback()`	GENERATE only; inexact OK (Ideogram V2 → Flux Pro)
Magic prompt	`MAGIC_PROMPT_SYSTEM_PROMPT` + `MAGIC_PROMPT_MODELS`	Per-feature LLM before provider
GCS normalization	`getAttachmentWithGBucketUrl()`	Download temp provider URL → permanent GCS
Moderation	Pre: `checkPromptFlagged()` · Post: `isImageFlagged()` (templates only)	Pre = block-before-spend; fails open on bad JSON
Style system	`PRESET_STYLES_MAP` + `getStyleModifiedPrompt()`	Provider-specific application

Feature	Model	Output
GENERATE	GPT-4o-mini	`{ enhancedPrompt }` ≤1000 chars
INPAINT	GPT-4o	`{ enhancedPrompt, isRemoveOnly }` — "remove car" vs "replace with tree"
EDIT_BG	GPT-4o-mini	Background only — never "the subject" / "the person"
GHIBLI / Product / Logo	GPT-4o-mini	Hyper-detailed style system prompts
OFF	Defaults	e.g. `isRemoveOnly: false`, `imageStrength: 0.8` for GHIBLI

Step	What	Note
1	Zod validation	prompt, modelConfig, feature, style, numImages, `isPublic`
2	Auth + usage confirm	—
3	`checkPromptFlagged()`	Blocking — saves provider $
4	`improvePrompt()` magic prompt	~700 tokens
5	Abstract model `.transform()`	`bonkers-advance` → `fal-ai/ideogram/v3`
6	Prefix dispatch (`helpers/generate.ts`)	See §7
7	`callWithFallback()`	2–30s — slowest step
8	`formatImageGenerations()`	See §8
9	Firestore save	`wallflower/{uid}/images/`
10	SSE events	`attachments` + `usage`
11	`incrementUserUsage()`	Charge abstract model query cost

Model prefix	Handler	Try first
`black-forest-labs/`, `recraft-ai/`, `ideogram-ai/*`	`handleReplicateWithFalAIFallback()`	Replicate
`fal-ai/*`	`handleFalWithReplicateFallback()`	FAL
`gpt-image-1-*`	OpenAI handler	OpenAI
`gemini-*`	Google Vertex	Google
`midjourney-*`	GoAPI handler	GoAPI

Layer	Detail
Cloud Run	0→N autoscale · stateless · 6GB RAM · 60min timeout · `us-west1` · shares container with chat
Providers	They scale GPUs — Bonkers = orchestrator
Firestore	Image subcollections scale; `customers/{uid}` = 1 write/sec bottleneck
Redis	`rate-limiter-flexible` · IRC pub/sub · `STOP_GENERATING` TTL cache

Bottleneck	Fix
Firestore usage counter	Redis-sharded distributed counters
SEO blocking	Async via Cloud Tasks
Gallery SSR	ISR + on-demand revalidation
Photon SPOF	Inline JS + circuit breaker
FAL outage	5/8 features down (no fallback)
No decision logging	Sentry breadcrumbs per pipeline step

Feature key	Who consumes
`bonkers`	BONKERS_PRO / BONKERS_BASIC subscribers
`merlin`	Free/PRO/TEAMS/ELITE using image gen via chat — 1 FLUX Pro image (140 queries) can exhaust daily chat quota
`wallflower`	Legacy `/image-generation` endpoint only

#	Gap	Fix
1	4/8 features die on FAL outage	Add redundant providers for edit features
2	GCS orphan on Firestore failure	Write-ahead `pending` status
3	Moderation fails open on bad JSON	try/catch → default `flagged: true`
4	Regenerate inconsistency	Magic prompt temp > 0; fallback changes model; seed not tied to concrete modelId
5	Post-gen moderation templates only	Extend `isImageFlagged()` to all features
6	SEO blocks response 500ms–2s	Async Cloud Tasks
7	No real per-provider $ tracking	`IMAGE_TOKEN_TO_QUERY_RATIO` is fictional
8	Firestore usage hotspot	Redis counter shards
9	ImageKit CDN race	Retry/poll or serve GCS directly until propagated
10	No error SSE event	Add explicit error event type
11	No pipeline decision logging	Sentry breadcrumbs (chat has `decisionLog`, images don't)

They ask	You say
Why not WebSockets?	Sticky sessions on Cloud Run; SSE = HTTP, autoscale-friendly
Why `@microsoft/fetch-event-source`?	EventSource can't POST or send Firebase JWT
50K users?	No — 50K generations/month, ~1.7K/day
Scalable?	Yes today; bottlenecks at 10× with known fixes
Promise.allSettled speed up gen?	No — speeds post-processing SEO/GCS after provider returns
What if FAL goes down?	GENERATE degrades via fallback; 4 edit features + templates on FAL-only models break
Regenerate gives different image?	Magic prompt non-determinism + fallback may switch provider; seed tied to model
ImageKit 404 on fresh upload?	CDN propagation race 0–5s — image exists in GCS
Free users?	Blocked at `usageLimits` — never reach controller
Chat tool vs direct Bonkers?	`isToolCall` skips SSE/GCS; same engine underneath
Templates vs saved prompts?	Firestore configs with versioned model+prompt — product feature, not user saves
NDA / no code?	Walk architecture and data flow; reference codenames and patterns not proprietary keys

Path	Role
`endpoints/wallflower/unified-generation.controller.ts`	Central orchestrator
`unified-generation.schema.ts`	Zod + abstract model `.transform()`
`helpers/generate.ts`	Prefix routing + `callWithFallback()`
`repositories/streamer/`	SSE server engine
`middlewares/usageLimits/`	Quota gate
`utilities/usage.ts`	`incrementUserUsage()`
Provider helpers	`replicate/` · `fal-ai/` · `ideogram/` · `openai-image-gen.ts` · `google-image-gen/`

Product	AI creative production — generate, edit, template, gallery (not chat-with-images)
Company / URL	Foyer · getmerlin.in
Scale	~50K generations/month (~1.7K/day, ~70/hr) — NOT 50K users
Impact	+50% DAU · -30% latency · 10K template images month 1 (~20% of volume)
Stack	`bonkers/` frontend · `merlin-arcane/` backend · `/v1/wallflower/`
Controller	`unified-generation.controller.ts`
Storage	GCS `wallflower-images/{uid}/{iid}.png` · Firestore `wallflower/{uid}/images/`
Providers	5: FAL · Replicate · OpenAI · Google Vertex · GoAPI
Models / Features / Templates	20+ / 8 / 9
Killer differentiator	Abstract model mapping — Zod `.transform()`, zero frontend changes on provider swap
One sentence	Cross-provider image engine — 5 providers, unified API, sequential failover, moderation, GCS normalization

shubho-learning-bonkers

Bonkers — Interview Revision Sheet

0. Cheat Sheet (60-second scan)

shubho-learning-bonkers

Bonkers — Interview Revision Sheet

0. Cheat Sheet (60-second scan)

1. What & Why

2. Your Contribution — v2 → v3

3. Abstract Model Mapping (core pattern)

4. Eight Features + Nine Templates

5. Six Platform Layers

6. Request Flow — Middleware + 11 Steps + SSE

7. Smart Routing & Fallback

8. Post-Processing & Data Model

9. Architecture & Scaling

10. Quota & Auth (one section, minimal)

11. Known Gaps + Fixes (say proactively)

12. Interview Scripts

13. Probe Q&A (quick follow-ups)

14. Key Files

15. Skip (Merlin ecosystem, not Bonkers engine)