Quantcast
Channel: DEVONtechnologies Community - Latest posts
Viewing all articles
Browse latest Browse all 16117

DT finds string in an img's data-url – intentionally?

$
0
0

If I search for the string “isbn”, DT returns three documents, two of which contain the string (one MD, one PDF).

The third document is a formatted note consisting of an img element with the image in a data-URL. And this data does contain the string, like so:
BYvgWeymUbWAMIKQbwWgSCUDQPA8DQQgEUgEBlMYpVgE9ikogEA~~isBn~~sAVEI0oCAWyKICohkE5GC8Cw
(I added ~~ around it to make it easier recognizable).

Does it really make sense to return matches from serialized image data?

In my opinion, it would be more sensible to index only the text nodes of a formatted note’s DOM.


Viewing all articles
Browse latest Browse all 16117

Trending Articles