Performance Profiling — March 2026

TL;DR

The platform is slow because of three systemic patterns, not random one-off issues:

N+1 data fetching — Multiple pages fetch a list of IDs, then loop over each ID with individual Redis calls. This turns a 50ms query into a 2–6 second waterfall. Affects: library respondent detail (3.1s TTFB), BVF admin (5.6s TTFB), admin dashboard (7.9s LCP).
Unoptimized heavy assets — Animated GIF thumbnails are 500KB–2MB each, loaded raw without optimization. A grid with 6 videos downloads 3–12MB of GIFs sequentially, taking 15 seconds per image. This is the biggest perceived slowness for library users.
Invisible backend — The video-processor (our slowest service at 61.7s p75) has tracing completely disabled. We can't diagnose why it's slow because it sends zero data to Sentry. We also have zero custom spans anywhere in the codebase — we can see "page took 4s" but not which Redis call, Clerk call, or function caused it.

Bottlenecks — User-Facing First

Ordered by user impact. Each section includes the original finding and the investigation results.

1. Animated GIF thumbnails — 15s per image (biggest user-visible slowness)

Who's affected: Every library user viewing video grids — the most common screen. Ticket: perf-investigate-gif-thumbnails

Root cause: Video-processor generates 500KB–2MB animated GIFs (apps/video-processor/src/operations/thumbnails.ts). The library loads them via raw <img> tags without Next.js Image optimization (packages/app-library/src/shared/AnimatedThumbnail.tsx). The preload logic (packages/app-library/src/shared/extract-preload-thumbnails.ts) actually prefers animated thumbnails, front-loading the heaviest assets.

Investigation findings:

The AnimatedThumbnail component already accepts separate staticSrc and animatedSrc props. The fix requires only ~5 lines: add hover state to VideoDashboardGridItem, pass animatedSrc only when hovered.
Video-processor already generates static JPEG thumbnails (30–60KB each) for every video. No backend changes needed.
Preload logic is a one-line fix: flip animatedThumbnailUrl || thumbnailUrl to thumbnailUrl || animatedThumbnailUrl.
Payload reduction: ~95% — grid initial load drops from 3–12MB to 180–360KB.
WebP/MP4 alternatives are worth doing as Phase 2 but diminishing returns after the JPEG-default fix.

Fix: Show static JPEG by default (already generated), only load animated GIF on hover. Update preload logic. Effort: XS (1–2 hours).

Expected improvement: Grid pages from 30s+ → <5s with multiple videos.

2. Hello `/start` — 2.7s TTFB (every new user's first impression)

Who's affected: Every new user hitting the onboarding flow. Ticket: perf-investigate-hello-start

Root cause (corrected): Originally described as "two independent Clerk API calls running sequentially." Investigation revealed 7 sequential async calls forming a complete waterfall: searchParams → getDraftIdFromCookie() → registry.getDraft() → auth() → clerkClient() → clerk.getOrganization() → registry.getOrganizationProfile().

Investigation findings:

Two fully independent branches that run sequentially: (A) cookie-based draft lookup (no auth needed), (B) Clerk auth-based org profile lookup.
Inside branch B, after auth() returns orgId, clerk.getOrganization() and registry.getOrganizationProfile() are independent of each other but run serially.
Important caveat: For genuinely new users (no Clerk session), auth() returns { userId: null } immediately and getMyOrganizationProfile() early-returns null. The 2.7s TTFB may NOT be caused by these serial calls for new users — it may be Clerk middleware overhead.

Fix: Two changes: (A) Parallelize the page-level branches with Promise.all(), (B) Parallelize Clerk API + KV fetch inside getMyOrganizationProfile(). Effort: XS–S (1 hour).

Expected improvement: ~600–700ms saved for authenticated users (2.7s → ~2.0s). New user improvement needs Sentry spans to confirm bottleneck location.

3. Library respondent detail — 3.1s TTFB (flow manager workflow)

Who's affected: Flow managers reviewing individual respondent submissions — a key workflow screen. Ticket: perf-investigate-library-respondent

Root cause: N+1 inside getAllSessionIds() (packages/registries/src/server/video-processing-registry.ts, line 383). To return session IDs sorted by createdAt, it fetches the full session data for every session in the flow. Each getRespondentData() call makes 2 HGETALL calls — one for session data, one to re-verify the session belongs to the flow (redundant).

Investigation findings:

The page already uses Promise.all() for its 5 top-level data calls — the bottleneck is the internal N+1.
Total Redis round-trips: 2N + 6 where N = respondents. For 20 respondents: 46 Redis calls.
Additional waste: getClient() makes 2 HGETALL calls (one clientExists check + one getAllValues) when 1 suffices.
Session index (client__{flowId}__sessions) is re-fetched N+1 times redundantly.

Fix (three tiers):

P0: New getAllSessionIdsSorted method — fetch only createdAt per session (1 HGET vs 2 HGETALL). Saves N calls. Effort: S.
P1: Fix getClient double-fetch (2→1 HGETALL). Effort: XS.
P2: Pre-fetch session index and share across callers. Saves ~N+1 more calls. Effort: M.

Expected improvement: Redis round-trips drop from ~46 to ~24 for 20 respondents. Wall-clock savings ~100–200ms. Remaining TTFB dominated by Clerk API calls.

4. Clerk JS bundle — 25s on slow connections (affects all apps)

Who's affected: Every user on a slow connection (mobile, poor wifi). Ticket: perf-clerk-eager-loading-spike (Backlog — spike for later)

Root cause: ClerkProvider wraps the entire app at root layout level. The Clerk JS bundle is eagerly loaded and blocks all rendering. On slow 3G/4G, Sentry shows it taking up to 25.7s.

Status: Parked for later spike. High risk (auth flow changes affect all apps). Needs investigation of Clerk's lazy loading options and careful testing.

Bottlenecks — Internal

5. BVF `/admin` — 5.6s TTFB

Who's affected: Flow managers using BVF admin (internal). Ticket: perf-investigate-bvf-admin

Root cause (corrected): Originally described as "per-flow fetches are serialized." Investigation confirmed the outer Promise.all() for per-flow getMergedSessionsForFlow calls already exists. The actual bottleneck is the inner N+1: each getAllClientSessions(flowId) fetches session IDs then individually fetches each session's data.

Investigation findings:

Total Redis round-trips on cold cache: ~318 Redis calls + 7 Clerk API calls for ~30 flows / ~200 sessions.
Two caching layers: getFilteredFlows() cached 1h, getAllFlowsSessions() cached 5min. The 5.6s TTFB is a cold-cache scenario.
Flow config fetching has its own N+1: getAllFlows() does 1 + 2N calls, getFlowOrgMapping() duplicates the client index fetch.

Update (2026-03-04): The admin/page.tsx now calls videoFlowActions.getAllSessions() (a single bulk Redis fetch for all raw sessions) and groups results in-memory, eliminating the outermost raw-session N+1. The remaining bottleneck is getMergedSessionsForFlow fanning out per-flow to the processing registry for enrichment.

Remaining fix: Reduce getMergedSessionsForFlow processing registry fan-out. Pagination (10 flows per page) and Redis pipeline support in BaseStorage are still valuable for further improvement.

6. Admin dashboard — 7.9s LCP

Who's affected: Every admin user on first page load of the day (internal). Ticket: perf-investigate-admin-dashboard

Root cause (corrected): Investigation found getAllClients() is called TWICE — once via getAllVideoFlows() and once via getGroupedVideoFlows(). This doubles the N+1 cost. The 60s in-memory cache only covers Clerk organizations, NOT the Redis client data.

Investigation findings:

Each getAllClients() call: 1 + N Redis round-trips. Called twice = 2 + 2N calls. With 30 flows: 62 Redis round-trips.
getAllClients() already uses Promise.all internally (individual fetches are concurrent). The issue is duplication + lack of caching.
BaseStorage/StorageDriver have no bulk/pipeline methods. RedisDriver uses ioredis which natively supports pipeline().

Fix (three tiers):

Quick win: Deduplicate getAllClients() — call once, derive grouping in JS. Effort: XS (15 min).
Cache: Add 60s in-memory cache to getAllClients(). Effort: S (30 min).
Pipeline: Add hgetallMulti() to BaseStorage/RedisDriver for true batch operations. Effort: M (2–3 hours).

The Observability Gap

Ticket (video-processor): perf-investigate-video-processor-tracing Ticket (custom spans): perf-investigate-custom-sentry-spans

Zero custom Sentry spans exist in the entire codebase. All performance data comes from auto-instrumentation.

Video-processor tracing (investigation complete)

setupExpressErrorHandler(app) is already correctly placed in both services. No changes needed.
@sentry/node@10.32.1 with OpenTelemetry — enabling tracing auto-instruments Express routes.
Performance overhead: negligible (FFmpeg is CPU-bound in a child process, Sentry traces only Node.js event loop).
No PII concerns (request bodies not captured by default, only HTTP paths/methods/timing).
Estimated additional spans: ~500–1,000/month (0.02% of quota).

Fix: One-line change in two files: tracesSampleRate: 0 → 0.2. Effort: XS (15 min).

Custom Sentry spans (investigation complete)

Recommended pattern: withRedisSpan() / withAuthSpan() helper functions wrapping Sentry.startSpan().

Key findings:

API confirmed: Sentry.startSpan() with onlyIfParent: true is the correct API (@sentry/nextjs@^10.32.1). Zero overhead when no transaction is active.
Proxy pattern doesn't work: BaseStorage methods are protected and only called via this.method() from subclasses. A Proxy only intercepts external property access — it would capture zero calls.
Instrument 12 BaseStorage methods + 3 OrganizationClient leaf methods (checkOrganizationAuth, getCurrentOrganization, getAllUserMemberships). The verify* methods don't need direct wrapping — Sentry auto-creates parent-child spans via async context.
Key redaction: redactKey() helper replaces IDs with placeholders (org:{orgId}, vf:{flowId}) to avoid PII in spans.
Naming: op: "db.redis" + name: "{REDIS_CMD} {redacted_key}" for storage; op: "auth.clerk" + name: "clerk.{method}" for auth.
Estimated additional spans: ~25K–42K/month (0.7% of quota, bringing total to ~24.7%).

Fix: Create helper functions, wrap 15 methods, add peer dependencies. Effort: S (2–3 hours).

Recommended Priority

Ordered by user impact and effort:

#	Finding	Ticket	Effort	Quick Win?
1	GIF thumbnails (95% payload reduction)	perf-investigate-gif-thumbnails	XS	Yes — ~5 lines
2	Hello `/start` parallelization	perf-investigate-hello-start	XS–S	Yes
3	Library respondent N+1	perf-investigate-library-respondent	S	Yes (P0 tier)
4	Video-processor tracing	perf-investigate-video-processor-tracing	XS	Yes — 1-line x2
5	Admin dashboard dedup + cache	perf-investigate-admin-dashboard	XS–S	Yes
6	Custom Sentry spans	perf-investigate-custom-sentry-spans	S	Foundation for future
7	BVF admin bulk fetch + pagination	perf-investigate-bvf-admin	M	No
8	Clerk eager loading	perf-clerk-eager-loading-spike	M–L	No — needs spike

Cross-cutting improvement: Adding hgetallMulti() pipeline support to BaseStorage/RedisDriver would benefit findings #3, #5, #6, and #7. This is the single highest-leverage infrastructure change.

Appendix A: Per-App Performance Data

Raw Sentry data tables for reference. Data window: 14 days (ending 2026-03-02), p75 unless noted.

Admin (Sentry project `4510642999197776`)

Frontend — Page Loads (p75)

Page	Duration	LCP	FCP	TTFB	Hits
`/` (dashboard)	7,918ms	5,048ms	4,373ms	65ms	356
`/hello-drafts`	6,556ms	1,190ms	1,190ms	3,929ms	41
`/sign-in`	6,224ms	5,907ms	4,092ms	1,076ms	106
`/video-exporter`	6,204ms	3,324ms	3,324ms	150ms	6
`/flows/:videoFlowId/:respondentId`	4,088ms	3,908ms	2,684ms	135ms	178
`/flows/:videoFlowId/intelligence`	3,411ms	1,287ms	1,287ms	99ms	58
`/flows/:videoFlowId`	2,707ms	3,659ms	3,558ms	183ms	256

Backend — Server Transactions (p75)

Route	Duration	Hits
`POST /flows/[videoFlowId]/[respondentId]`	61,736ms	35
`GET /api/export-raw-clips`	7,602ms	40
`GET /`	1,631ms	735
`GET /sign-in/[[...sign-in]]`	1,390ms	695

Explore: Slowest transactions · Slowest spans · Backend only

Library (Sentry project `4510080388366416`)

Frontend — Page Loads (p75)

Page	Duration	LCP	FCP	TTFB	Hits
`/sign-in`	5,432ms	5,032ms	5,426ms	1,081ms	105
`/dashboard/:videoFlowId/:respondentId`	4,633ms	4,656ms	3,304ms	3,149ms	30
`/switch-org`	4,581ms	2,388ms	2,388ms	—	5
`/`	4,385ms	2,640ms	1,908ms	670ms	170
`/dashboard/:videoFlowId`	2,945ms	1,908ms	1,164ms	741ms	50

Slowest Spans (7-day window)

Operation	Description	p75	Max	Count
`default`	`/api/webhooks/process-video/route`	37,169ms	39,002ms	40
`http.client`	POST video-processor-prod	30,621ms	33,735ms	65
`resource.img`	Animated GIF from Vercel Blob	15,080ms	15,080ms	5

Explore: Slowest spans · Slowest transactions

Branded Video Flow (Sentry project `4510080275120208`)

Page	Duration	LCP	FCP	TTFB	Hits
`/admin`	7,652ms	5,696ms	5,696ms	5,610ms	5
`/`	4,657ms	2,292ms	2,556ms	1,229ms	40
`/flows/:flowId`	3,468ms	1,918ms	1,792ms	1,167ms	535
`/flows/:flowId/frame`	1,305ms	520ms	484ms	419ms	5

Explore: Slowest transactions

Hello (Sentry project `4510515885834320`)

Page	Duration	LCP	FCP	TTFB	Hits
`/start`	7,486ms	5,307ms	5,702ms	2,741ms	110
`/flows/vf_ki4j-rSNmDlM`	5,643ms	3,372ms	3,372ms	2,322ms	5
`/`	4,050ms	3,024ms	3,218ms	662ms	100

Explore: Slowest transactions

Appendix B: Sentry Configuration Reference

SDK Config per App

App	`tracesSampleRate`	`profilesSampleRate`	Browser Profiling	Spans (30d)
library	0.2	0.1	10%	420,405
admin	0.2	0.1	10%	260,949
branded-video-flow	0.2	0.1	10%	266,684
hello	0.2	0.1	10%	214,478
links	0.2	0.1	10%	38,980
demo	0.1	—	No	140
video-processor	0	0.1	N/A	0
sync-worker	0	0.1	N/A	0

Total: ~1.2M spans/month of 5M included (24% of quota).

Config File Locations

Next.js server: apps/*/instrumentation.ts
Next.js client: apps/*/sentry.client.config.ts
Express: apps/video-processor/src/index.ts (lines 46–79), apps/sync-worker/src/index.ts (line 23)
Shared builders: packages/core/src/server/sentry.ts, packages/core/src/web/sentry.ts

Data Layer (Instrumentation Targets)

All app data flows through a registry pattern. Instrumenting BaseStorage covers everything:

App Page → Registry (e.g., VideoProcessingRegistry)
               → BaseStorage (12 methods)  ← instrument here
                    → RedisDriver (11 methods)
                         → ioredis → Redis

BaseStorage (packages/storage/src/base-storage.ts) — 12 methods:

Hash: getValue, setValue, getAllValues, deleteValue
String: getSimpleValue, setSimpleValue, setSimpleValueWithTTL, deleteSimpleValue
Sorted set: sortedSetAdd, sortedSetRevRange, sortedSetRemove
Scan: scanSimpleKeys

OrganizationClient (packages/auth/src/server/organization-client.ts) — 8 methods (3 leaf methods to instrument):

Instrument: checkOrganizationAuth, getCurrentOrganization, getAllUserMemberships
Skip (delegate to above): verifySignedInUser, verifySignedInOrganization, verifyOrganisationAdmin, checkUserMembership, checkOrganization (sync)

Span count queries: Admin · Library · BVF · Hello

Performance Profiling — March 2026

TL;DR

Bottlenecks — User-Facing First

1. Animated GIF thumbnails — 15s per image (biggest user-visible slowness)

2. Hello /start — 2.7s TTFB (every new user's first impression)

3. Library respondent detail — 3.1s TTFB (flow manager workflow)

4. Clerk JS bundle — 25s on slow connections (affects all apps)

Bottlenecks — Internal

5. BVF /admin — 5.6s TTFB

6. Admin dashboard — 7.9s LCP

The Observability Gap

Video-processor tracing (investigation complete)

Custom Sentry spans (investigation complete)

Recommended Priority

Appendix A: Per-App Performance Data

Admin (Sentry project 4510642999197776)

Library (Sentry project 4510080388366416)

Branded Video Flow (Sentry project 4510080275120208)

Hello (Sentry project 4510515885834320)

Appendix B: Sentry Configuration Reference

SDK Config per App

Config File Locations

Data Layer (Instrumentation Targets)

Related Documents

2. Hello `/start` — 2.7s TTFB (every new user's first impression)

5. BVF `/admin` — 5.6s TTFB

Admin (Sentry project `4510642999197776`)

Library (Sentry project `4510080388366416`)

Branded Video Flow (Sentry project `4510080275120208`)

Hello (Sentry project `4510515885834320`)