Interesting…
Magic was what I was looking for in the absence of documentation to suggest these limitations. Image files have structured data elements that might be detectable and aggregatable. Even the title could suggest a grouping. Considerations for the future…
I am converting the pdfs with OCR and then will test on text-based files