switch to html2text() instead of strip_tags() when preparing FTS index

This commit is contained in:
Andrew Dolgov
2023-10-21 10:51:24 +03:00
parent 2b61052e87
commit 03e956132d
73 changed files with 27833 additions and 17 deletions
+50
View File
@@ -0,0 +1,50 @@
<html>
<body>
<div>
Hello
<br>
</div>
<div>
How are you?
<br>
</div>
<p>
How are you?
<br>
</p>
<p>
How are you?
<br>
</p>
<div>
Just two divs
</div>
<div>
Hanging out
</div>
This is not the end!
<div>
How are you again?
<br>
</div>
This is the end!
<br>
Just kidding
<h1>Header 1</h1>
Some text
<hr>
Some more text
<p>Paragraph tag!</p>
<h2>Header 2</h2>
<hr>
<h3>Header 3</h3>
Some text
<h4>Header 4</h4>
<p>Paragraph tag!</p>
Final line
</body>
</html>