We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comparing different libraries for parsing HTML and found that Skyscraper fails in some cases when other (sxd_html or one on Swift) works fine.
let link = "https://livejournal.com/"; let response = reqwest::blocking::get(link).expect("load url error"); let html_text = response.text().expect("get html text"); let document = skyscraper::html::parse(&html_text).expect("parse html");
returns parse html: EndTagMismatch { end_name: "svg", open_name: "symbol" }
parse html: EndTagMismatch { end_name: "svg", open_name: "symbol" }
The text was updated successfully, but these errors were encountered:
I'm currently working on a rewrite of the HTML module. It will follow the official HTML standard as defined by https://html.spec.whatwg.org/multipage/parsing.html. Hopefully that will solve your issues.
It's a lot of work though so I don't really have an ETA - depends how much free time I get.
Sorry, something went wrong.
No branches or pull requests
Comparing different libraries for parsing HTML and found that Skyscraper fails in some cases when other (sxd_html or one on Swift) works fine.
returns
parse html: EndTagMismatch { end_name: "svg", open_name: "symbol" }
The text was updated successfully, but these errors were encountered: