How to (code-)efficiently traverse a DOM? #41

therealprof · 2017-02-27T13:31:03Z

I do have a rather simple DOM which I'd like to traverse but the regular DOM implementation makes it rather tedious to actually navigate around in the parsed tree. To get to the root element of the document I'm using this unsightly code at the moment (there may or may not be a comment before the root element so I have to filter that):

    let root = doc.root()
        .children()
        .into_iter()
        .find(|&x|
            if let dom::ChildOfRoot::Element(_) = x {
                true
            } else {
                false
            }
        )
        .unwrap()
        .element()
        .unwrap();

The next level (of interest) is a <model> which I'm getting at like:

    let model = root.children()
        .into_iter()
        .find(|&x| {
            if let Some(name) = x.element() {
                name.name().local_part() == "model"
            } else {
                false
            }
        })
        .unwrap()
        .element()
        .unwrap();

and so on and so on.

It seems tinydom would provide more convenient access to the DOM but that looks unfinished and under-documented at the moment.

Is there a more elegant way to traverse the DOM, like a direct iterator over all children as elements directly so I can skip all the naughty element-ification and unwrapping?

The text was updated successfully, but these errors were encountered:

shepmaster · 2017-02-27T20:31:46Z

It sounds like you want an XPath:

extern crate sxd_xpath;

use sxd_xpath::{evaluate_xpath, Value};

fn main() {
    let value = evaluate_xpath(&doc, "/*/model").expect("XPath evaluation failed");
    if let Value::Nodeset(nodes) {
        // do something with the nodes.
    }
}

tinydom would provide more convenient access to the DOM

That's actually an experimental interface that should have the same capabilities as the traditional DOM interface but provide different compile-time tradeoffs. It's "underdocumented" in the sense that it's deliberately hidden from the docs 😉

shepmaster · 2017-02-27T20:33:21Z

You'll also note that / and /* correspond to the Root type and the single Element child of the Root type that we are discussing in #40, which is part of the reason they are different types.

therealprof · 2017-02-28T07:49:29Z

I'm not sure using XPath is a key to success here. While it can be used to get to the subdocuments I need, the parsing of those subdocuments is going to be as onerous as it is getting to them right now. I see XPath more as a tool to search for or extract partial information from a document but I really need to consume all of it.

shepmaster added the question label Feb 27, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to (code-)efficiently traverse a DOM? #41

How to (code-)efficiently traverse a DOM? #41

therealprof commented Feb 27, 2017

shepmaster commented Feb 27, 2017

shepmaster commented Feb 27, 2017 •

edited

Loading

therealprof commented Feb 28, 2017

How to (code-)efficiently traverse a DOM? #41

How to (code-)efficiently traverse a DOM? #41

Comments

therealprof commented Feb 27, 2017

shepmaster commented Feb 27, 2017

shepmaster commented Feb 27, 2017 • edited Loading

therealprof commented Feb 28, 2017

shepmaster commented Feb 27, 2017 •

edited

Loading