After removing everything that isn’t documents or files, etc. I now have these three databases. Quick inspection says that the duplicates have a lot of false positives, but I’ll need to clear out the actual duplicates before I can be sure of the numbers.
3893 duplicates
30 duplicates
1292 duplicates