You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Not able to identify whether the input document has & or & in the text node, since Jsoup escapes the character in text node. Same goes to other entities like </<.
This does not provide any control to the Jsoup users where they can take any action based on input. For example; If we want to remove < character in text node but preserve when given as entity <
Note: Please let me know if there is already a way to differentiate this.
Providing an option where I could input Jsoup to not modify the text node will be super helpful. This provides more flexibility and control to the customers.
Muthukirthan
changed the title
Jsoup - Not able to identify escaped/unescaped html entity in the text nodes 💭
💭 Jsoup - Not able to identify escaped/unescaped html entity in the text nodes
Oct 6, 2024
Not able to identify whether the input document has
&
or&
in the text node, since Jsoup escapes the character in text node. Same goes to other entities like<
/<
.This does not provide any control to the Jsoup users where they can take any action based on input. For example; If we want to remove
<
character in text node but preserve when given as entity<
Note: Please let me know if there is already a way to differentiate this.
Providing an option where I could input Jsoup to not modify the text node will be super helpful. This provides more flexibility and control to the customers.
@jhy
The text was updated successfully, but these errors were encountered: