107 analysis engines for voice & speech research. Built for phonetics, prosody, and computational paralinguistics. REST API with JSON output, batch support, and full reproducibility.
One endpoint. One audio file. 107 engines run in parallel. Results in JSON with sub-second latency on standard research hardware.
curl -X POST https://api.oravys.com/v1/analyze \ -H "Authorization: Bearer YOUR_API_KEY" \ -H "Content-Type: multipart/form-data" \ -F "audio=@sample.wav" \ -F "engines=all" \ -F "format=json"
{ "status": "success", "engine_count": 107, "feature_count": 137, "duration_ms": 843, "authenticity_score": 0.9312, "emotion_profile": { "dominant": "neutral", "valence": 0.62, "arousal": 0.31, "confidence": 0.88 }, "features": { "f0_mean": 182.4, // Hz "f0_std": 28.7, // Hz "jitter_local": 0.0041, // ratio "shimmer_local": 0.071, // dB "hnr": 18.3, // dB "cpp": 12.1, // dB "mfcc": [-312.4, 82.1, -14.2, "..."], "formants": { "F1": 731, "F2": 1182, "F3": 2638 } }, "engines": { "DeepfakeDetector": { "score": 0.04, "label": "genuine" }, "EmotionEngine": { "dominant": "neutral", "valence": 0.62 }, "PersonalityEngine": { "openness": 0.71, "conscientiousness": 0.64 }, "...": "104 more engines" }, "model": { "version": "V4", "f1_score": 0.9996, "parameters": "1.1M", "arch": "BiLSTM+TransformerEncoder+AttentionPool" } }
Each engine is independently registered via @register_engine,
runs in parallel, and returns a typed JSON object.
Hover any card to see engine names.
Foundational authenticity detection and deepfake classification. Primary BiLSTM+Transformer pipeline with GroupSplit certification.
Audio forensics, manipulation detection, compression artifact identification, and chain-of-custody integrity analysis.
Vocal biomarkers for health research. Stress physiology, dysphonia markers, and vocal fatigue indices. EU AI Act Article 50 compliant.
Big Five, HEXACO, and MBTI acoustic correlates. Prosody-to-trait mapping from peer-reviewed models.
Workplace communication quality, leadership acoustics, and persuasion dynamics for organizational research.
Acoustic quality metrics: HNR, jitter, shimmer, CPP, GNE. Full phonatory analysis compatible with Praat exports.
Time-series analysis of vocal changes: longitudinal tracking, spectral trajectory, and rhythm dynamics.
Dyadic interaction patterns, entrainment, and conversation dynamics for social speech research.
Every analysis computes the full 137-dimensional feature vector. Grouped below by perceptual category with Praat-compatible naming.
Every engine references validated literature. Designed for reproducibility and publication-ready output.
@software{oravys2026, author = {ORAVYS Ltd.}, title = {ORAVYS: 107-Engine Voice Analysis API for Speech Research}, year = {2026}, version = {V4.0}, url = {https://app.oravys.com/academic}, note = {BiLSTM+TransformerEncoder, F1=0.9996, 137 acoustic features, 9 domains} }
ORAVYS is classified as a research tool, not a lie detector. All clinical outputs are labeled as biomarkers requiring qualified interpretation. Lawful use in EU academic contexts confirmed.
All tiers include full access to all 107 engines, JSON export, and batch processing. Upgrade or cancel at any time.
OpenAPI 3.1 spec, interactive playground, and annotated examples for every endpoint.
API key management, Bearer token format, key rotation, and rate-limit headers explained.
View guide
Full reference for /v1/analyze,
/v1/batch,
/v1/features, and
/v1/engines.
Async callbacks for batch jobs. HMAC-signed payloads, retry policy, and dead-letter queue configuration.
View guide
Complete error taxonomy: HTTP status codes, ORAVYS error
enums, and machine-readable error.type fields.
Get your free API key and run your first analysis in under 2 minutes.