Scientific Methodology

Engine Cards

Transparency about what ORAVYS can and cannot do. Each engine category is documented with its capabilities, limitations, scientific basis, and known biases.

Transparency

Every metric includes a confidence score. We disclose limitations openly.

Bias Awareness

We actively test for and mitigate biases related to accent, language, gender, and age.

Peer-Reviewed

Our methods build on published research in bio-acoustics and speech science.

Privacy First

Audio is never stored beyond analysis. No voice fingerprinting. No tracking.

Forensic Analysis
Cognitive congruence, authenticity verification, stress indicators
28 Engines

What It Analyzes

  • Micro-tremor patterns associated with cognitive load during deception
  • Acted vs. genuine speech classification (trained on CREMA-D + VoxPopuli)
  • Vocal stress indicators (fundamental frequency variation, jitter, shimmer)
  • Temporal consistency and speech rate anomalies

Scientific Basis

  • Ekman & Friesen (1969) - Micro-expression and vocal leakage theory
  • DePaulo et al. (2003) - Cues to deception meta-analysis
  • Kirchhubel & Howard (2013) - Acoustic correlates of deceptive speech

Known Limitations

  • No lie detector is 100% accurate. Results are probabilistic, not deterministic.
  • Cultural differences affect vocal expression patterns significantly.
  • Professional actors or trained individuals may confound detection.
  • Audio quality (background noise, compression) reduces accuracy.
Confidence range: 55-92%
Min audio: 15 seconds
Languages: EN, FR, ES, DE, HE
Personality & Emotional Profile
Big Five traits, emotional intelligence, communication style
16 Engines

What It Analyzes

  • Big Five personality traits inferred from vocal patterns (openness, conscientiousness, extraversion, agreeableness, neuroticism)
  • Emotional valence and arousal from prosodic features
  • Communication style classification (assertive, empathetic, analytical, expressive)
  • 42-emotion intelligence matrix from acoustic biomarkers

Scientific Basis

  • Scherer (2003) - Vocal communication of emotion
  • Mohammadi & Vinciarelli (2012) - Automatic personality perception from speech
  • Mairesse et al. (2007) - Using linguistic cues for personality recognition

Known Limitations

  • Personality inference from voice is probabilistic; self-report questionnaires remain the gold standard.
  • Emotional state at recording time affects trait inference.
  • Accents and dialects may influence perceived traits.
Confidence range: 60-88%
Min audio: 30 seconds
Optimal: 2-5 minutes of natural speech
Clinical Indicators
Stress biomarkers, cognitive load, vocal health
Research Stage

What It Analyzes

  • Vocal stress biomarkers (cortisol-correlated frequency patterns)
  • Cognitive load estimation from speech disfluencies and timing
  • Vocal fatigue and strain indicators
  • Respiratory pattern analysis from speech segments

Scientific Basis

  • Titze (1994) - Principles of Voice Production
  • Cummins et al. (2015) - Speech analysis for health applications
  • Low et al. (2020) - Automated assessment of psychiatric disorders using speech

Known Limitations

  • NOT a medical device. Results do not constitute medical diagnosis.
  • Clinical indicators are correlational, not causational.
  • Environmental factors (noise, microphone quality) affect readings.
  • Individual baseline variation is significant; longitudinal tracking improves accuracy.
Confidence range: 50-80%
Status: Research / informational only
Disclaimer: Not FDA/CE approved
Professional & Communication
Leadership voice, negotiation style, presentation impact
5 Engines

What It Analyzes

  • Authority and confidence indicators in speech patterns
  • Persuasion effectiveness metrics (pace, emphasis, pause strategy)
  • Communication clarity and articulation scoring
  • Vocal presence and projection assessment

Known Limitations

  • Professional voice assessment is context-dependent; a courtroom voice differs from a coaching voice.
  • Cultural norms affect what constitutes "authoritative" or "persuasive" speech.
  • Should be used as coaching feedback, not employment screening.
Confidence range: 65-90%
Best for: Coaching, self-improvement, presentation prep
Core Voice Analysis
Fundamental acoustic features, voice quality, temporal patterns
21 Engines

What It Analyzes

  • Fundamental frequency (F0), formants, harmonics-to-noise ratio
  • MFCC features (40 coefficients + deltas) for spectral characterization
  • Temporal features: speech rate, pause patterns, rhythm
  • Voice quality: breathiness, roughness, strain, vocal age estimation

Technical Details

  • Audio input: WAV/MP3/WebM, 8kHz-48kHz sampling rate
  • Feature extraction: librosa, praat-parselmouth
  • ML architecture: BiLSTM + Transformer attention (custom ORAVYS models)
  • Inference: ONNX Runtime for CPU-optimized real-time analysis
Processing time: ~3-8 seconds
Min quality: 8kHz, 16-bit
Optimal: 16kHz+, quiet environment

How ORAVYS Works

A three-stage pipeline from audio input to intelligence output.

1

Capture

Audio recorded in-browser (WebRTC) or uploaded. Encrypted in transit (TLS 1.3).

2

Extract

27+ engines extract 200+ acoustic features in parallel. Processing: 3-8 seconds.

3

Synthesize

Results aggregated into a unified report with confidence scores and insights.

GUEST