Evaluating Text Extraction: Apache Tika's New Tika-Eval Module

Evaluating Text Extraction: Apache Tika's New Tika-Eval Module

Linux Foundation via YouTube Direct link

Regression Testing

8 of 33

8 of 33

Regression Testing

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Evaluating Text Extraction: Apache Tika's New Tika-Eval Module

Automatically move to the next video in the Classroom when playback concludes

  1. 1 Introduction
  2. 2 Overview
  3. 3 Whats different
  4. 4 Content Extraction
  5. 5 Metadata
  6. 6 Blood on the Highway
  7. 7 Search
  8. 8 Regression Testing
  9. 9 What Can Go Wrong
  10. 10 Hidden Problems
  11. 11 Example of Missing Text
  12. 12 Dream
  13. 13 Evaluation Metric
  14. 14 TikaEval Overview
  15. 15 TikaEval Definitions
  16. 16 Why TikaEval
  17. 17 TikaEval
  18. 18 Profile
  19. 19 Compare
  20. 20 StartDB
  21. 21 Profile Reports
  22. 22 Common Words Metric
  23. 23 Similarity Metric
  24. 24 Common Word Metric
  25. 25 Evaluation Metric Public
  26. 26 Limitations
  27. 27 Human Interpretation
  28. 28 Conclusion
  29. 29 Resources
  30. 30 Thank you
  31. 31 Data import handler
  32. 32 Metadata normalization
  33. 33 Application dependent

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.