mirror of https://github.com/kjanat/livedash-node.git synced 2026-01-16 20:52:09 +01:00

Files

Kaj Kowalski e2301725a3 feat: complete development environment setup and code quality improvements

- Set up pre-commit hooks with husky and lint-staged for automated code quality
- Improved TypeScript type safety by replacing 'any' types with proper generics
- Fixed markdown linting violations (MD030 spacing) across all documentation
- Fixed compound adjective hyphenation in technical documentation
- Fixed invalid JSON union syntax in API documentation examples
- Automated code formatting and linting on commit
- Enhanced error handling with better type constraints
- Configured biome and markdownlint for consistent code style
- All changes verified with successful production build

2025-07-13 14:44:05 +02:00

3.0 KiB

Raw Blame History

Session Processing with OpenAI

This document explains how the session processing system works in LiveDash-Node.

Overview

The system now includes an automated process for analyzing chat session transcripts using OpenAI's API. This process:

Fetches session data from CSV sources
Only adds new sessions that don't already exist in the database
Processes session transcripts with OpenAI to extract valuable insights
Updates the database with the processed information

How It Works

Session Fetching

The system fetches session data from configured CSV URLs for each company
Unlike the previous implementation, it now only adds sessions that don't already exist in the database
This prevents duplicate sessions and allows for incremental updates

Transcript Processing

For sessions with transcript content that haven't been processed yet, the system calls OpenAI's API
The API analyzes the transcript and extracts the following information:
- Primary language used (ISO 639-1 code)
- Number of messages sent by the user
- Overall sentiment (positive, neutral, negative)
- Whether the conversation was escalated
- Whether HR contact was mentioned or provided
- Best-fitting category for the conversation
- Up to 5 paraphrased questions asked by the user
- A brief summary of the conversation

Scheduling

The system includes two schedulers:

Session Refresh Scheduler: Runs every 15 minutes to fetch new sessions from CSV sources
Session Processing Scheduler: Runs every hour to process unprocessed sessions with OpenAI

Database Schema

The Session model has been updated with new fields to store the processed data:

processed: Boolean flag indicating whether the session has been processed
sentimentCategory: String value ("positive", "neutral", "negative") from OpenAI
questions: JSON array of questions asked by the user
summary: Brief summary of the conversation

Configuration

OpenAI API Key

To use the session processing feature, you need to add your OpenAI API key to the .env.local file:

OPENAI_API_KEY=your_api_key_here

Running with Schedulers

To run the application with schedulers enabled:

Development: npm run dev
Development (with schedulers disabled): npm run dev:no-schedulers
Production: npm run start

Note: These commands will start a custom Next.js server with the schedulers enabled. You'll need to have an OpenAI API key set in your .env.local file for the session processing to work.

Manual Processing

You can also manually process sessions by running the script:

node scripts/process_sessions.mjs

This will process all unprocessed sessions that have transcript content.

Customization

The processing logic can be customized by modifying:

lib/processingScheduler.ts: Contains the OpenAI processing logic
scripts/process_sessions.ts: Standalone script for manual processing

3.0 KiB Raw Blame History