Is it possible to get text after html tag? #355

sooxt98 · 2020-10-23T07:11:17Z

<span></span>1
<span></span>2
<span></span>3

i want to get 1,2,3 out;

i tried with doc.Contents().Each it just return the whole text out at once

The text was updated successfully, but these errors were encountered:

mna · 2020-10-23T16:05:37Z

Not directly with a selector, but see #287 .

sooxt98 · 2020-10-23T16:08:07Z

@mna i think goquery cant separate that example code into 6 chucks, it just return one big whole chunk with all the text inside ; so .Contents().Each is useless for me

mna · 2020-10-23T16:13:38Z

What do you mean? Did you try the example in the issue I linked?

const data = `
<div>
<span></span>1
<span></span>2
<span></span>3
</div>
`

func main() {
	doc, err := goquery.NewDocumentFromReader(strings.NewReader(data))
	if err != nil {
		log.Fatal(err)
	}
	doc.Find("div").Contents().Each(func(i int, s *goquery.Selection) {
		if goquery.NodeName(s) == "#text" {
			fmt.Printf(">>> (%d) >>> %s\n", i, s.Text())
		}
	})
}

// Prints:
// >>> (0) >>> 
// >>> (2) >>> 1
// >>> (4) >>> 2
// >>> (6) >>> 3

sooxt98 · 2020-10-23T16:17:28Z

@mna please try not to wrap with parent div

mna · 2020-10-23T16:19:48Z

Just change the "div" selector (which obviously won't work) with "body".

sooxt98 · 2020-10-23T16:22:45Z

Okay thanks, I think i might manually add the tag around it

mna · 2020-10-23T16:25:39Z

Oh you don't have to, if all you have is the three spans, when parsed with the net/html parser, it will automatically add the html/head/body tags to make it a proper document (the Go html parser uses the same logic as the official html5 parser unsed in browsers, so it tries hard to "fix" documents to make them valid). When in doubt, you should print the full html document after the call to goquery.NewDocument... (using goquery.OuterHtml(doc.Selection)).

sooxt98 closed this as completed Oct 23, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is it possible to get text after html tag? #355

Is it possible to get text after html tag? #355

sooxt98 commented Oct 23, 2020 •

edited

Loading

mna commented Oct 23, 2020

sooxt98 commented Oct 23, 2020

mna commented Oct 23, 2020

sooxt98 commented Oct 23, 2020

mna commented Oct 23, 2020

sooxt98 commented Oct 23, 2020

mna commented Oct 23, 2020

Is it possible to get text after html tag? #355

Is it possible to get text after html tag? #355

Comments

sooxt98 commented Oct 23, 2020 • edited Loading

mna commented Oct 23, 2020

sooxt98 commented Oct 23, 2020

mna commented Oct 23, 2020

sooxt98 commented Oct 23, 2020

mna commented Oct 23, 2020

sooxt98 commented Oct 23, 2020

mna commented Oct 23, 2020

sooxt98 commented Oct 23, 2020 •

edited

Loading