The example here has HTML with a somewhat fixed format. It would indeed have been better to have samples with different format and aiming for a low error rate.
If you are scraping a limited amount of sites, you could for each site ask the LLM for parsing code from some samples, review that, and move on.
If you are scraping a limited amount of sites, you could for each site ask the LLM for parsing code from some samples, review that, and move on.