Streamlit UI Guide¶

Launch the UI

bash streamlit run ui_premium.py The UI will open in your browser at http://localhost:8501

Overview¶

The NLP Cyber Incident Triage Laboratory is a comprehensive Streamlit web application providing an interactive dashboard for security incident analysis. It combines real-time classification, visual analytics, threat intelligence, and SOC automation in a modern, user-friendly interface.

Screenshot: UI Overview

Main interface showing the analysis dashboard

🚀 Getting Started¶

Launch the Application¶

From the project root directory:

# Activate virtual environment
source .venv/bin/activate

# Launch UI
streamlit run ui_premium.py

The interface will automatically open in your default browser at http://localhost:8501.

First-Time Setup¶

Before using the UI, ensure you have:

✅ Installed all dependencies: pip install -e .
✅ Model files downloaded (run pytest tests/test_model.py -v to trigger automatic download)
✅ (Optional) LLM model for second opinions: Download Llama-2-7B-Chat GGUF

📋 Interface Modes¶

The UI offers three primary analysis modes accessible from the sidebar:

🔍 Single Incident Analysis¶

Analyze individual security incidents with comprehensive intelligence.

Screenshot: Single Incident Mode

Single incident analysis with real-time classification

Features:

Real-time incident classification
Confidence scoring with visual gauge
Probability distribution charts
MITRE ATT&CK technique mapping
Threat intelligence panel
SOC playbook recommendations
Risk radar visualization

Workflow:

Enter incident description in the text area
Configure analysis settings in sidebar (threshold, difficulty, LLM)
Click "🔍 Analyze Incident"
Explore results across five analysis tabs

📊 Bulk Analysis Intelligence Center¶

Process multiple incidents from uploaded files with advanced analytics.

Screenshot: Bulk Analysis Mode

Bulk processing dashboard with aggregate metrics

Features:

CSV/TXT file upload support
Batch processing with progress tracking
Aggregate statistics and metrics
LLM upgrade tracking (shows when AI changes classifications)
Interactive filtering by label, confidence, uncertainty
Export results as CSV/JSON
Comprehensive threat intelligence briefs

Workflow:

Upload incidents file (CSV with description column or TXT with one incident per line)
Configure batch processing settings
Monitor real-time progress
Review aggregate analytics
Filter and export results

🧪 Experimental Lab¶

Advanced features for research and experimentation.

Screenshot: Experimental Lab Experimental analysis tools

Features:

Text similarity analysis
Incident clustering
Model performance comparison
Synthetic data generation
Advanced feature extraction
IOC lookup and threat feeds

The sidebar provides comprehensive control over analysis parameters:

Screenshot: Sidebar Settings

Configuration panel with all analysis settings

Analysis Settings¶

Difficulty Mode

default - Standard thresholds (50% confidence)
soc-medium - Moderate strictness (60% confidence)
soc-hard - Maximum strictness (75% confidence)

Confidence Threshold

Slider: 0.0 to 1.0
Default: 0.50
Controls when predictions are marked "uncertain"

Max Classes

Number of top predictions to display
Range: 1-7
Useful for exploring runner-up classifications

LLM Configuration¶

Enable LLM Second Opinion

Toggle AI-assisted classification
Engages automatically for uncertain cases
Provides alternative perspective with rationale

Debug Mode

Shows detailed LLM prompts and responses
Useful for troubleshooting
Performance analysis

Visualization Options¶

Advanced Visualizations

Enable enhanced charts and graphs
Risk radar charts
Confidence distributions
MITRE technique heatmaps

📊 Analysis Tabs (Single Incident)¶

Tab 1: 🎯 Analysis¶

Core classification results with key metrics.

Screenshot: Analysis Tab Main analysis results with confidence metrics

Displays:

Final classification label
Confidence score with gauge visualization
Uncertainty level indicator
Top-N probability distribution
Class probability pie chart

Tab 2: 📊 Visualizations¶

Interactive charts and visual analytics.

Screenshot: Visualizations Tab Visual analytics dashboard

Charts:

Confidence gauge (speedometer-style)
Probability distribution (pie chart)
Risk radar (multi-dimensional assessment)
Text complexity metrics

Tab 3: 🕵️ Threat Intel¶

Comprehensive threat intelligence analysis.

Screenshot: Threat Intel Tab Threat intelligence panel with IOC extraction

Features:

MITRE ATT&CK technique mapping
IOC extraction (IPs, URLs, emails)
Attack sophistication scoring
Threat landscape context
Related campaigns/TTPs

Tab 4: 📋 SOC Playbook¶

Context-aware incident response recommendations.

Screenshot: Playbook Tab SOC playbook with actionable recommendations

Provides:

Incident priority (P1-P5)
Response timeline
Step-by-step actions
Context-specific guidance
Escalation paths

Tab 5: 🔧 Technical Details¶

Raw data and technical information.

Screenshot: Technical Details Tab Technical debugging and raw JSON output

Includes:

Full JSON response
Text complexity analysis
Model metadata
Debug information
LLM prompts/responses (if enabled)

📈 Bulk Analysis Features¶

Upload & Processing¶

Supported Formats:

CSV:

description
"User reported suspicious email with attachment"
"Multiple failed login attempts from Asia"
"Unusual outbound traffic to 192.168.1.100"

TXT (one incident per line):

User reported suspicious email with attachment
Multiple failed login attempts from Asia
Unusual outbound traffic to 192.168.1.100

Processing:

Real-time progress bar
Estimated time remaining
Incident counter
Error handling with retry logic

Screenshot: Bulk Upload File upload and processing interface

Results Dashboard¶

After processing completes, view comprehensive analytics:

Screenshot: Bulk Results Aggregate results with filtering and export

Metrics:

Total incidents processed
Label distribution
Average confidence
LLM-resolved count (when second opinion used)
Uncertain case count

Interactive Table:

Sortable columns
Filterable by label, confidence, uncertainty
Expandable rows for full incident text
Color-coded by risk level

Advanced Analytics¶

Access four analytics dashboards:

📈 Overview

Label distribution pie chart
Confidence histogram
Timeline analysis
MITRE technique frequency

🎯 Confidence Analysis

Confidence vs label scatter plot
Uncertainty distribution
High/low confidence breakdown
Threshold impact analysis

⚡ Performance

Processing speed metrics
Model inference time
LLM overhead analysis
Resource utilization

🔬 Deep Dive

Text complexity analysis
N-gram frequency
IOC extraction summary
Correlation matrices

Screenshot: Advanced Analytics Advanced analytics with multiple visualization panels

Export Options¶

Download results in multiple formats:

CSV Export:

description,label,confidence,display_label,llm_override,mitre_techniques
"...",phishing,0.87,phishing,No,"T1566.001,T1204.002"

JSON Export:

{
  "description": "...",
  "label": "phishing",
  "confidence": 0.87,
  "display_label": "phishing",
  "llm_second_opinion": {...},
  "probabilities": {...},
  "mitre_techniques": ["T1566.001"]
}

Threat Intelligence Brief:

Executive summary (Markdown/PDF)
MITRE coverage report
Critical incidents highlight
Strategic recommendations

🧪 Experimental Lab Tools¶

Text Similarity Analysis¶

Compare incidents and find similar patterns.

Screenshot: Similarity Analysis Text similarity clustering visualization

Methods:

TF-IDF cosine similarity
Semantic embeddings
Clustering (K-means, DBSCAN)
Similarity heatmaps

Model Comparison¶

Benchmark different classifiers.

Models:

Logistic Regression (baseline)
Random Forest
Linear SVM
Ensemble methods

Metrics:

Accuracy comparison
Confusion matrices
Per-class performance
Feature importance

Synthetic Data Generation¶

Create test datasets on-demand.

Parameters:

Incident type
Complexity level
Batch size
Include IOCs/MITRE/timestamps
Export format

⚙️ Configuration¶

Environment Variables¶

The UI respects the same environment variables as the CLI:

# LLM Configuration
export TRIAGE_LLM_MODEL=/path/to/llama-2-7b-chat.Q5_K_S.gguf
export TRIAGE_LLM_DEBUG=1
export NLP_TRIAGE_LLM_TEMPERATURE=0.2
export NLP_TRIAGE_LLM_MAX_TOKENS=512

# Model Paths
export TRIAGE_LLM_CTX=4096

See Configuration Guide for complete settings.

Custom Styling¶

Modify ui_premium.py to customize the interface:

Color Schemes:

# Located at top of ui_premium.py
.metric-card {
    background: linear-gradient(135deg, #667eea 0%, #764ba2 100%);
    ...
}

Chart Types:

Plotly graph configurations
Streamlit theme settings
Layout adjustments

💡 Tips & Best Practices¶

Performance Optimization¶

✅ Use baseline first - Test without LLM for 10-20x faster processing
✅ Batch processing - Process multiple incidents in bulk mode
✅ Adjust max classes - Limit to 3-5 for faster rendering
✅ Cache results - Export and reload rather than re-analyzing

Accuracy Improvements¶

✅ Tune thresholds - Lower (0.3-0.4) for coverage, higher (0.6-0.7) for precision
✅ Use difficulty modes - soc-hard for critical infrastructure
✅ Enable LLM selectively - Only for uncertain/high-stakes cases
✅ Review uncertain - Manual analysis for low-confidence predictions

Workflow Recommendations¶

✅ Single → Bulk - Test single incidents first, then scale to bulk
✅ Export everything - Save results for audit trails
✅ Use playbooks - Follow SOC recommendations systematically
✅ Monitor metrics - Track confidence trends over time

What to Avoid¶

❌ Don't trust blindly - Always review uncertain predictions
❌ Don't over-rely on LLM - It's decision support, not ground truth
❌ Don't ignore confidence - Low scores = unreliable classifications
❌ Don't skip validation - Verify results against ground truth when available

🔧 Troubleshooting¶

Common Issues¶

"Model files not found"

# Trigger automatic download
pytest tests/test_model.py -v

Slow LLM processing

Use quantized models (Q5_K_S recommended)
Enable GPU acceleration via llama-cpp-python
Reduce context window: export TRIAGE_LLM_CTX=2048
Lower temperature for faster generation

CSV upload fails

Ensure description column exists
Check UTF-8 encoding
Remove empty rows
Verify proper CSV delimiter (comma)

UI crashes or freezes

Check terminal output for errors
Verify sufficient RAM (8GB+ recommended)
Close other applications
Reduce LLM context window if OOM

Blank visualizations

Enable "Advanced Visualizations" in sidebar
Check browser console for JavaScript errors
Refresh page
Try different browser (Chrome/Firefox recommended)

Debug Mode¶

Enable detailed logging:

export TRIAGE_LLM_DEBUG=1
streamlit run ui_premium.py

Check terminal output for:

LLM prompts and responses
Model loading status
Processing errors
Performance metrics

CLI Usage - Command-line interface guide
LLM Integration - Setting up local LLM models
Configuration - Environment variables and settings
API Reference - Programmatic access
Architecture - System design and components

Need help? Open an issue on GitHub or check the FAQ.

Streamlit UI Guide¶

Overview¶

🚀 Getting Started¶

Launch the Application¶

First-Time Setup¶

📋 Interface Modes¶

🔍 Single Incident Analysis¶

📊 Bulk Analysis Intelligence Center¶

🧪 Experimental Lab¶

🎛️ Sidebar Configuration¶

Analysis Settings¶

LLM Configuration¶

Visualization Options¶

📊 Analysis Tabs (Single Incident)¶

Tab 1: 🎯 Analysis¶

Tab 2: 📊 Visualizations¶

Tab 3: 🕵️ Threat Intel¶

Tab 4: 📋 SOC Playbook¶

Tab 5: 🔧 Technical Details¶

📈 Bulk Analysis Features¶

Upload & Processing¶

Results Dashboard¶

Advanced Analytics¶

Export Options¶

🧪 Experimental Lab Tools¶

Text Similarity Analysis¶

Model Comparison¶

Synthetic Data Generation¶

⚙️ Configuration¶

Environment Variables¶

Custom Styling¶

💡 Tips & Best Practices¶

Performance Optimization¶

Accuracy Improvements¶

Workflow Recommendations¶

What to Avoid¶

🔧 Troubleshooting¶

Common Issues¶

Debug Mode¶

🔗 Related Documentation¶