Gupta, Kaiser, Neistadt, Grimm, 2003. DOM-based content extraction of HTML documents, in: . ACM Press.. https://doi.org/10.1145/775152.775182