Skip to content

Metrics Explained

Performance Metrics

Speed Metrics

  • Extraction Time: Wall-clock time for complete extraction (seconds)
  • Files per Second: Throughput measurement for batch processing
  • Time per MB: Normalized extraction time by file size

Memory Metrics

  • Peak Memory (RSS): Maximum resident set size during extraction
  • Average Memory: Mean memory usage throughout extraction
  • Memory per MB: Memory usage normalized by file size

CPU Metrics

  • Peak CPU%: Maximum CPU utilization during extraction
  • Average CPU%: Mean CPU utilization throughout extraction

Quality Metrics

Text Quality

  • Character Count: Total characters extracted
  • Word Count: Total words extracted
  • Completeness Score: Percentage of content successfully extracted

Accuracy Metrics

  • Quality Score: Overall quality assessment (0-100)
  • Format Preservation: How well formatting is maintained
  • Metadata Extraction: Success in extracting document metadata

Reliability Metrics

Success Metrics

  • Success Rate: Percentage of successful extractions
  • Partial Success Rate: Files with partial content extracted
  • Timeout Rate: Percentage of files that exceeded time limit

Error Metrics

  • Failure Rate: Percentage of complete failures
  • Error Categories: Classification of failure types
  • Recovery Rate: Ability to extract partial content on error

Composite Scores

Overall Score

Calculated as weighted average: - Speed: 30% - Memory Efficiency: 20% - Quality: 30% - Reliability: 20%

Grade Calculation

  • A+: 95-100
  • A: 90-94
  • B+: 85-89
  • B: 80-84
  • C+: 75-79
  • C: 70-74
  • D: 60-69
  • F: <60