mirror of https://github.com/kjanat/livedash-node.git synced 2026-01-16 21:12:08 +01:00

Files

Kaj Kowalski e2301725a3 feat: complete development environment setup and code quality improvements

- Set up pre-commit hooks with husky and lint-staged for automated code quality
- Improved TypeScript type safety by replacing 'any' types with proper generics
- Fixed markdown linting violations (MD030 spacing) across all documentation
- Fixed compound adjective hyphenation in technical documentation
- Fixed invalid JSON union syntax in API documentation examples
- Automated code formatting and linting on commit
- Enhanced error handling with better type constraints
- Configured biome and markdownlint for consistent code style
- All changes verified with successful production build

2025-07-13 14:44:05 +02:00

4.7 KiB

Raw Blame History

Scheduler Workflow Documentation

Overview

The LiveDash system has two main schedulers that work together to fetch and process session data:

Session Refresh Scheduler - Fetches new sessions from CSV files
Processing Scheduler - Processes session transcripts with AI

Current Status (as of latest check)

Total sessions: 107
Processed sessions: 0
Sessions with transcript: 0
Ready for processing: 0

How the `processed` Field Works

The ProcessingScheduler picks up sessions where processed is NOT true, which includes:

processed = false
processed = null

Query used:

{ processed: { not: true } } // Either false or null

Complete Workflow

Step 1: Session Refresh (CSV Fetching)

What it does:

Fetches session data from company CSV URLs
Creates session records in database with basic metadata
Sets transcriptContent = null initially
Sets processed = null initially

Runs: Every 30 minutes (cron: */30 * * * *)

Step 2: Transcript Fetching

What it does:

Downloads full transcript content for sessions
Updates transcriptContent field with actual conversation data
Sessions remain processed = null until AI processing

Runs: As part of session refresh process

Step 3: AI Processing

What it does:

Finds sessions with transcript content where processed != true
Sends transcripts to OpenAI for analysis
Extracts: sentiment, category, questions, summary, etc.
Updates session with processed data
Sets processed = true

Runs: Every hour (cron: 0 * * * *)

Manual Trigger Commands

Check Current Status

node scripts/manual-triggers.js status

Trigger Session Refresh (Fetch new sessions from CSV)

node scripts/manual-triggers.js refresh

Trigger AI Processing (Process unprocessed sessions)

node scripts/manual-triggers.js process

Run Both Schedulers

node scripts/manual-triggers.js both

Troubleshooting

No Sessions Being Processed?

Check if sessions have transcripts:

node scripts/manual-triggers.js status

If "Sessions with transcript" is 0:

Sessions exist but transcripts haven't been fetched yet
Run session refresh: node scripts/manual-triggers.js refresh

If "Ready for processing" is 0 but "Sessions with transcript" > 0:

All sessions with transcripts have already been processed
Check if OPENAI_API_KEY is set in environment

Common Issues

"No sessions found requiring processing"

All sessions with transcripts have been processed (processed = true)
Or no sessions have transcript content yet

"OPENAI_API_KEY environment variable is not set"

Add OpenAI API key to .env.development file
Restart the application

"Error fetching transcript: Unauthorized"

CSV credentials are incorrect or expired
Check company CSV username/password in database

Database Field Mapping

Before AI Processing

{
  id: "session-uuid",
  transcriptContent: "full conversation text" | null,
  processed: null,
  sentimentCategory: null,
  questions: null,
  summary: null,
  // ... other fields
}

After AI Processing

{
  id: "session-uuid", 
  transcriptContent: "full conversation text",
  processed: true,
  sentimentCategory: "positive" | "neutral" | "negative",
  questions: '["question 1", "question 2"]', // JSON string
  summary: "Brief conversation summary",
  language: "en", // ISO 639-1 code
  messagesSent: 5,
  sentiment: 0.8, // Float value (-1 to 1)
  escalated: false,
  forwardedHr: false,
  category: "Schedule & Hours",
  // ... other fields
}

Scheduler Configuration

Session Refresh Scheduler

File: lib/scheduler.js
Frequency: Every 30 minutes
Cron: */30 * * * *

Processing Scheduler

File: lib/processingScheduler.js
Frequency: Every hour
Cron: 0 * * * *
Batch size: 10 sessions per run

Environment Variables Required

# Database
DATABASE_URL="postgresql://..."

# OpenAI (for processing)
OPENAI_API_KEY="sk-..."

# NextAuth
NEXTAUTH_SECRET="..."
NEXTAUTH_URL="http://localhost:3000"

Next Steps for Testing

Trigger session refresh to fetch transcripts:

node scripts/manual-triggers.js refresh

Check status to see if transcripts were fetched:

node scripts/manual-triggers.js status

Trigger processing if transcripts are available:

node scripts/manual-triggers.js process

View results in the dashboard session details pages

4.7 KiB Raw Blame History

Scheduler Workflow Documentation

Overview

Current Status (as of latest check)

How the processed Field Works

Complete Workflow

Step 1: Session Refresh (CSV Fetching)

Step 2: Transcript Fetching

Step 3: AI Processing

Manual Trigger Commands

Check Current Status

Trigger Session Refresh (Fetch new sessions from CSV)

Trigger AI Processing (Process unprocessed sessions)

Run Both Schedulers

Troubleshooting

No Sessions Being Processed?

Common Issues

"No sessions found requiring processing"

"OPENAI_API_KEY environment variable is not set"

"Error fetching transcript: Unauthorized"

Database Field Mapping

Before AI Processing

After AI Processing

Scheduler Configuration

Session Refresh Scheduler

Processing Scheduler

Environment Variables Required

Next Steps for Testing

4.7 KiB

Raw Blame History

How the `processed` Field Works