html_attr()
gets a single attribute; html_attrs()
gets all attributes.
html_attr(x, name, default = NA_character_) html_attrs(x)
x | A document (from |
---|---|
name | Name of attribute to retrieve. |
default | A string used as a default value when the attribute does not exist in every element. |
A character vector (for html_attr()
) or list (html_attrs()
)
the same length as x
.
html <- minimal_html('<ul> <li><a href="https://a.com" class="important">a</a></li> <li class="active"><a href="https://c.com">b</a></li> <li><a href="https://c.com">b</a></li> </ul>') html %>% html_elements("a") %>% html_attrs() #> [[1]] #> href class #> "https://a.com" "important" #> #> [[2]] #> href #> "https://c.com" #> #> [[3]] #> href #> "https://c.com" #> html %>% html_elements("a") %>% html_attr("href") #> [1] "https://a.com" "https://c.com" "https://c.com" html %>% html_elements("li") %>% html_attr("class") #> [1] NA "active" NA html %>% html_elements("li") %>% html_attr("class", default = "inactive") #> [1] "inactive" "active" "inactive"