Title: | Concise, Lazy and Reliable Wrapper for 'chromote' and 'selenium' |
---|---|
Description: | A user-friendly wrapper for web automation, using either 'chromote' or 'selenium'. Provides a simple and consistent API to make web scraping and testing scripts easy to write and understand. Elements are lazy, and automatically wait for the website to be valid, resulting in reliable and reproducible code, with no visible impact on the experience of the programmer. |
Authors: | Ashby Thorpe [aut, cre, cph] |
Maintainer: | Ashby Thorpe <[email protected]> |
License: | MIT + file LICENSE |
Version: | 0.4.1.9000 |
Built: | 2025-01-12 15:25:08 UTC |
Source: | https://github.com/ashbythorpe/selenider |
This function essentially turns x
into:
list(x[[1]], x[[2]], ...)
However, to do this, the length of x
must be computed. This means that
while each element inside the list is still lazy, the list itself cannot be
considered lazy, since the number of elements in the DOM may change. To
avoid problems, it is recommended to use an element list just after it is
created, to make sure the list is an accurate representation of the DOM
when it is being used.
## S3 method for class 'selenider_elements' as.list(x, timeout = NULL, ...)
## S3 method for class 'selenider_elements' as.list(x, timeout = NULL, ...)
x |
A |
timeout |
How long to wait for |
... |
Not used. |
Transform a selenider_elements
object into a list of
selenider_element
objects. The result can then be used in for loops and
higher order functions like lapply()
/purrr::map()
(whereas a
selenider_element
object cannot).
A list of selenider_element
objects.
elem_flatten()
to combine multiple
selenider_element
/selenider_elements
objects into a single object.
find_each_element()
and find_all_elements()
to select elements
using an element collection while preserving laziness.
html <- " <div id='div1'> <p>Text 1</p> </div> <div id='div2'> <p>Text 2</p> </div> <div id='div3'> <p>Text 3</p> </div> <div id='div4'> <p>Text 4</p> </div> " session <- minimal_selenider_session(html) p_tags <- ss("p") for (elem in as.list(p_tags)) { print(elem_text(elem)) } p_tags |> as.list() |> lapply(elem_text)
html <- " <div id='div1'> <p>Text 1</p> </div> <div id='div2'> <p>Text 2</p> </div> <div id='div3'> <p>Text 3</p> </div> <div id='div4'> <p>Text 4</p> </div> " session <- minimal_selenider_session(html) p_tags <- ss("p") for (elem in as.list(p_tags)) { print(elem_text(elem)) } p_tags |> as.list() |> lapply(elem_text)
back()
navigates to the previously opened URL, or the previously opened
page in your browsing history.
forward()
reverses the action of back()
, going to the next page in your
browsing history.
back(timeout = NULL, session = NULL) forward(timeout = NULL, session = NULL)
back(timeout = NULL, session = NULL) forward(timeout = NULL, session = NULL)
timeout |
The maximum time to wait for the page to load, in seconds. This defaults to 60, unless in a Github Action, in which case it defaults to 5 minutes. |
session |
A |
The session object, invisibly.
Other global actions:
current_url()
,
execute_js_fn()
,
get_page_source()
,
open_url()
,
reload()
,
scroll_to()
,
take_screenshot()
session <- selenider_session() open_url("https://r-project.org") open_url("https://www.tidyverse.org/") back() forward()
session <- selenider_session() open_url("https://r-project.org") open_url("https://www.tidyverse.org/") back() forward()
chromote_options()
and selenium_options()
return a list of options that
can be passed to the options
argument of selenider_session()
.
chromote_options()
allows you to control the creation of a chromote driver
created using chromote::ChromoteSession$new().
selenium_options()
allows you to control the creation of a selenium driver.
selenium_server_options()
should be passed to the server_options
argument of selenium_options()
, allowing you to control the creation of
the server using selenium::selenium_server()
.
selenium_client_options()
should be passed to the client_options
argument
of selenium_options()
, allowing you to control the creation of a Selenium
client created using
selenium::SeleniumSession$new().
chromote_options( headless = TRUE, parent = NULL, width = 992, height = 1323, targetId = NULL, wait_ = TRUE, auto_events = NULL ) selenium_options( client_options = selenium_client_options(), server_options = selenium_server_options() ) selenium_server_options( version = "latest", port = 4444L, selenium_manager = NULL, verbose = FALSE, temp = TRUE, path = NULL, interactive = FALSE, echo_cmd = FALSE, extra_args = c() ) selenium_client_options( port = 4444L, host = "localhost", verbose = FALSE, capabilities = NULL, request_body = NULL, timeout = 60 )
chromote_options( headless = TRUE, parent = NULL, width = 992, height = 1323, targetId = NULL, wait_ = TRUE, auto_events = NULL ) selenium_options( client_options = selenium_client_options(), server_options = selenium_server_options() ) selenium_server_options( version = "latest", port = 4444L, selenium_manager = NULL, verbose = FALSE, temp = TRUE, path = NULL, interactive = FALSE, echo_cmd = FALSE, extra_args = c() ) selenium_client_options( port = 4444L, host = "localhost", verbose = FALSE, capabilities = NULL, request_body = NULL, timeout = 60 )
headless |
Whether to run the browser in headless mode, meaning
that you won't actually be able to see the browser as you control it.
For debugging purposes and interactive use, it is often useful to set
this to |
parent |
The parent chromote session. |
width , height , targetId , wait_ , auto_events
|
Passed into chromote::ChromoteSession$new(). |
client_options |
A |
server_options |
A |
version |
The version of Selenium server to use. |
port |
The port number to use. |
selenium_manager , verbose , temp , path , interactive , echo_cmd , extra_args
|
Passed into |
host , capabilities , request_body , timeout
|
Passed into selenium::SeleniumSession$new(). |
Shut down a session object, closing the browser and stopping the server. This will be done automatically if the session is set as the local session (which happens by default).
close_session(x = NULL)
close_session(x = NULL)
x |
A |
Nothing.
session <- selenider_session(local = FALSE) close_session(session)
session <- selenider_session(local = FALSE) close_session(session)
These functions are deprecated and will be removed in a future release.
Use the options
argument to selenider_session()
instead. If you want
to manually create a chromote or selenium session, use
chromote::ChromoteSession, selenium::SeleniumSession and
selenium::selenium_server()
manually, since these functions
are only a thin wrapper around them.
create_chromote_session(parent = NULL, ...) create_selenium_server( browser, version = "latest", driver_version = "latest", port = 4444L, quiet = TRUE, selenium_manager = TRUE, ... ) create_selenium_client(browser, port = 4444L, host = "localhost", ...) create_rselenium_client(browser, port = 4444L, ...)
create_chromote_session(parent = NULL, ...) create_selenium_server( browser, version = "latest", driver_version = "latest", port = 4444L, quiet = TRUE, selenium_manager = TRUE, ... ) create_selenium_client(browser, port = 4444L, host = "localhost", ...) create_rselenium_client(browser, port = 4444L, ...)
parent , ... , version , driver_version , port , quiet , host
|
See the
documentation for |
browser |
The browser to use. |
selenium_manager |
If this is |
create_chromote_session()
returns a chromote::ChromoteSession object.
create_selenium_server()
returns a processx::process or wdman
equivalent.
create_selenium_client()
returns a selenium::SeleniumSession object.
create_rselenium_client()
returns an RSelenium::remoteDriver object.
Get the full URL of the current page.
current_url(session = NULL)
current_url(session = NULL)
session |
Optionally, a |
A string: the current URL.
Other global actions:
back()
,
execute_js_fn()
,
get_page_source()
,
open_url()
,
reload()
,
scroll_to()
,
take_screenshot()
session <- selenider_session() open_url("https://r-project.org") current_url()
session <- selenider_session() open_url("https://r-project.org") current_url()
Find all elements with a certain relative position to an HTML element.
elem_ancestors()
selects every element which contains the current element
(children, grand-children, etc.).
elem_parent()
selects the element that contains the current element.
elem_siblings()
selects every element which has the same parent as the
current element.
elem_children()
selects every element which is connected to and directly
below the current element.
elem_descendants()
selects every element that is contained by the current
element. The current element does not have to be a direct parent, but must
be some type of ancestor.
elem_ancestors(x) elem_parent(x) elem_siblings(x) elem_children(x) elem_descendants(x)
elem_ancestors(x) elem_parent(x) elem_siblings(x) elem_children(x) elem_descendants(x)
x |
A |
All functions except elem_children()
and elem_descendants()
use XPath
selectors, so may be slow, especially when using chromote
as a backend.
All functions return a selenider_elements
object, except
elem_parent()
, which returns a selenider_element
object (since an
element can only have one parent).
http://web.simmons.edu/~grovesd/comm244/notes/week4/document-tree for a simple and visual explanation of the document tree.
find_element()
and find_elements()
for other ways of selecting
elements. These functions allow you to select ancestors using one or more
conditions (e.g. CSS selectors).
elem_filter()
and elem_find()
for filtering element collections.
html <- " <html> <body> <div> <div id='current'> <p></p> <div> <p></p> <br> </div> </div> <div></div> <p></p> </div> </body> </html> " session <- minimal_selenider_session(html) current <- s("#current") # Get all the names of an element collection elem_names <- function(x) { x |> as.list() |> vapply(elem_name, FUN.VALUE = character(1)) } current |> elem_ancestors() |> elem_expect(has_length(3)) |> elem_names() # html, div, body current |> elem_parent() |> elem_name() # div current |> elem_siblings() |> elem_expect(has_length(2)) |> elem_names() # div, p current |> elem_children() |> elem_expect(has_length(2)) |> elem_names() # p, div current |> elem_descendants() |> elem_expect(has_length(4)) |> elem_names() # p, div, p, br
html <- " <html> <body> <div> <div id='current'> <p></p> <div> <p></p> <br> </div> </div> <div></div> <p></p> </div> </body> </html> " session <- minimal_selenider_session(html) current <- s("#current") # Get all the names of an element collection elem_names <- function(x) { x |> as.list() |> vapply(elem_name, FUN.VALUE = character(1)) } current |> elem_ancestors() |> elem_expect(has_length(3)) |> elem_names() # html, div, body current |> elem_parent() |> elem_name() # div current |> elem_siblings() |> elem_expect(has_length(2)) |> elem_names() # div, p current |> elem_children() |> elem_expect(has_length(2)) |> elem_names() # p, div current |> elem_descendants() |> elem_expect(has_length(4)) |> elem_names() # p, div, p, br
Get an attribute of a selenider_element
object.
elem_attr()
returns a single attribute value as a string.
elem_attrs()
returns a named list containing every attribute.
elem_value()
returns the 'value' attribute.
elem_attr(x, name, default = NULL, timeout = NULL) elem_attrs(x, timeout = NULL) elem_value(x, ptype = character(), timeout = NULL)
elem_attr(x, name, default = NULL, timeout = NULL) elem_attrs(x, timeout = NULL) elem_value(x, ptype = character(), timeout = NULL)
x |
A |
name |
The name of the attribute to get; a string. |
default |
The default value to use if the attribute does not exist in the element. |
timeout |
The time to wait for |
ptype |
The type to cast the value to. Useful when the value is an integer or decimal number. By default, the value is returned as a string. |
elem_attr()
returns a character vector of length 1. elem_attrs()
returns a named list of strings. The return value of elem_value()
has the
same type as ptype
and length 1.
Other properties:
elem_css_property()
,
elem_name()
,
elem_size()
,
elem_text()
html <- " <a class='link' href='https://r-project.org'>R</a> <input type='number' value='0'> " session <- minimal_selenider_session(html) s("a") |> elem_attr("href") s("a") |> elem_attrs() s("input[type='number']") |> elem_value(ptype = integer())
html <- " <a class='link' href='https://r-project.org'>R</a> <input type='number' value='0'> " session <- minimal_selenider_session(html) s("a") |> elem_attr("href") s("a") |> elem_attrs() s("input[type='number']") |> elem_value(ptype = integer())
selenider_element
/selenider_elements
objects are generally
lazy, meaning they only collect the actual element in the DOM
when absolutely necessary, and forget it immediately after. This
is to avoid situations where the DOM changes after an element
has been collected, resulting in errors and unreliable behaviour.
elem_cache()
forces an element or collection of elements to be collected
and stored, making it eager rather than lazy. This is useful when you are
operating on the same element multiple times, since only collecting the
element once will improve performance. However, you must be sure that the
element will not change on the page while you are using it.
elem_cache(x, timeout = NULL)
elem_cache(x, timeout = NULL)
x |
A |
timeout |
How long to wait for the element(s) to exist while collecting them. |
These functions do not make selenider elements permanently eager. Further sub-elements will not be cached unless specified.
For example, consider the following code:
s(".class1") |> elem_parent() |> elem_cache() |> find_element(".class2")
In this example, the parent of the element with class ".class1" will be cached, but the child element with class ".class2" will not.
A modified version of x
. The result of elem_cache()
can be used
as a normal selenider_element
/selenider_elements
object.
find_element()
and find_elements()
to select elements.
as.list.selenider_elements()
, find_each_element()
and
find_all_elements()
if you want to iterate over an element collection.
html <- " <div> <p id='specifictext'></p> <button></button> </div> " session <- minimal_selenider_session(html) # Selecting this button may be slow, since we are using relative XPath # selectors. button <- s("#specifictext") |> elem_siblings() |> elem_find(has_name("button")) # But we need to click the button 10 times! # Normally, this would involve fetching the button from the DOM 10 times click_button_10_times <- function(x) { lapply(1:10, \(unnused) elem_click(x)) invisible(NULL) } # But with elem_cache(), the button will only be fetched once cached_button <- elem_cache(button) click_button_10_times(cached_button) # But the cached button is less reliable if the DOM is changing execute_js_fn("x => { x.outerHTML = '<button></button>'; }", button) try(elem_click(cached_button, timeout = 0.1)) # But the non-cached version works elem_click(button)
html <- " <div> <p id='specifictext'></p> <button></button> </div> " session <- minimal_selenider_session(html) # Selecting this button may be slow, since we are using relative XPath # selectors. button <- s("#specifictext") |> elem_siblings() |> elem_find(has_name("button")) # But we need to click the button 10 times! # Normally, this would involve fetching the button from the DOM 10 times click_button_10_times <- function(x) { lapply(1:10, \(unnused) elem_click(x)) invisible(NULL) } # But with elem_cache(), the button will only be fetched once cached_button <- elem_cache(button) click_button_10_times(cached_button) # But the cached button is less reliable if the DOM is changing execute_js_fn("x => { x.outerHTML = '<button></button>'; }", button) try(elem_click(cached_button, timeout = 0.1)) # But the non-cached version works elem_click(button)
Clicks on an HTML element, either by simulating a mouse click or by triggering the element's "click" event.
elem_click()
left clicks on the element, elem_double_click()
left clicks
on the element two times in a short period of time, while
elem_right_click()
right clicks on an element, opening its context menu.
elem_click(x, js = FALSE, timeout = NULL) elem_double_click(x, js = FALSE, timeout = NULL) elem_right_click(x, js = FALSE, timeout = NULL)
elem_click(x, js = FALSE, timeout = NULL) elem_double_click(x, js = FALSE, timeout = NULL) elem_right_click(x, js = FALSE, timeout = NULL)
x |
A |
js |
Whether to click the element using JavaScript. |
timeout |
How long to wait for the element to exist. |
x
, invisibly.
Other actions:
elem_hover()
,
elem_scroll_to()
,
elem_select()
,
elem_set_value()
,
elem_submit()
html <- " <button onclick = hidetext() oncontextmenu = showtext()></button> <p id = 'texttohide'>Hello!</p> " js <- " function hidetext() { document.getElementById('texttohide').style.display = 'none' } function showtext() { document.getElementById('texttohide').style.display = 'block' } " session <- minimal_selenider_session(html, js = js) elem_expect(s("p"), is_visible) s("button") |> elem_click() elem_expect(s("p"), is_invisible) s("button") |> elem_right_click() elem_expect(s("p"), is_visible)
html <- " <button onclick = hidetext() oncontextmenu = showtext()></button> <p id = 'texttohide'>Hello!</p> " js <- " function hidetext() { document.getElementById('texttohide').style.display = 'none' } function showtext() { document.getElementById('texttohide').style.display = 'block' } " session <- minimal_selenider_session(html, js = js) elem_expect(s("p"), is_visible) s("button") |> elem_click() elem_expect(s("p"), is_invisible) s("button") |> elem_right_click() elem_expect(s("p"), is_visible)
Get a CSS property of an element (e.g. "background-color"
).
Specifically, the computed style is returned, meaning that,
for example, widths and heights will be returned in pixels, and
colours will be returned as an RGB value.
elem_css_property(x, name, timeout = NULL)
elem_css_property(x, name, timeout = NULL)
x |
A |
name |
The name of the CSS property to get. |
timeout |
The time to wait for |
A string, or NULL
if the property does not exist.
Other properties:
elem_attr()
,
elem_name()
,
elem_size()
,
elem_text()
html <- " <p style='visibility:hidden; color:red;'>Text</p> " session <- minimal_selenider_session(html) s("p") |> elem_css_property("visibility") s("p") |> elem_css_property("color")
html <- " <p style='visibility:hidden; color:red;'>Text</p> " session <- minimal_selenider_session(html) s("p") |> elem_css_property("visibility") s("p") |> elem_css_property("color")
Checks if two selenider_element
objects point to the
same element on the page. elem_equal()
is equivalent to
using ==
, but allows you to specify a timeout value if
needed.
elem_equal(x, y, timeout = NULL) ## S3 method for class 'selenider_element' e1 == e2
elem_equal(x, y, timeout = NULL) ## S3 method for class 'selenider_element' e1 == e2
x , y , e1 , e2
|
|
timeout |
How long to wait for the elements to be present. |
TRUE
or FALSE
.
elem_filter()
and elem_find()
for filtering collection of elements.
html <- " <div></div> <div class='second'> <p></p> </div> " session <- minimal_selenider_session(html) s("div") == ss("div")[[1]] has_p_child <- function(x) { x |> elem_children() |> # Direct children elem_filter(has_name("p")) |> has_at_least(1) } ss("div") |> elem_find(has_p_child) |> elem_equal(s(".second")) # TRUE
html <- " <div></div> <div class='second'> <p></p> </div> " session <- minimal_selenider_session(html) s("div") == ss("div")[[1]] has_p_child <- function(x) { x |> elem_children() |> # Direct children elem_filter(has_name("p")) |> has_at_least(1) } ss("div") |> elem_find(has_p_child) |> elem_equal(s(".second")) # TRUE
elem_expect()
waits for a set of conditions to return TRUE. If, after a
certain period of time (by default 4 seconds), this does not happen, an
informative error is thrown. Otherwise, the original element is returned.
elem_wait_until()
does the same, but returns a logical value (whether or
not the test passed), allowing you to handle the failure case explicitly.
elem_expect(x, ..., testthat = NULL, timeout = NULL) elem_wait_until(x, ..., timeout = NULL)
elem_expect(x, ..., testthat = NULL, timeout = NULL) elem_wait_until(x, ..., timeout = NULL)
x |
A |
... |
< |
testthat |
Whether to treat the expectation as a |
timeout |
The number of seconds to wait for a condition to pass. If not
specified, the timeout used for |
elem_expect()
invisibly returns the element(s) x
, or NULL
if an
element or collection of elements was not given in x
.
elem_wait_for()
returns a boolean flag: TRUE if the test passes, FALSE
otherwise.
Conditions can be supplied as functions or calls.
Functions allow you to use unary conditions without formatting them as a
call (e.g. is_present
rather than is_present()
). It also allows you to
make use of R's anonymous function syntax to quickly
create custom conditions. x
will be supplied as the first argument of this
function.
Function calls allow you to use conditions that take multiple arguments
(e.g. has_text()
) without the use of an intermediate function. The call
will be modified so that x
is the first argument to the function call. For
example, has_text("a")
will be modified to become: has_text(x, "a")
.
The and (&&
), or (||
) and not (!
) functions can be used on both types
of conditions. If more than one condition are given in ...
, they are
combined using &&
.
Any function which takes a selenider element or element collection as its first argument, and returns a logical value, can be used as a condition.
Additionally, these functions provide a few features that make creating custom conditions easy:
Errors with class expect_error_continue
are handled, and
the function is prevented from terminating early. This means that if
an element is not found, the function will retry instead of immediately
throwing an error.
selenider
functions used inside conditions have their timeout, by
default, set to 0, ignoring the local timeout. This is important, since
elem_expect()
and elem_wait_until()
implement a retry mechanic
manually. To override this default, manually specify a timeout.
These two features allow you to use functions like elem_text()
to access
properties of an element, without needing to worry about the errors that
they throw or the timeouts that they use. See Examples for a few examples of
a custom condition.
These custom conditions can also be used with elem_filter()
and
elem_find()
.
is_present()
and other conditions for predicates for HTML elements.
(If you scroll down to the See also section, you will find the rest).
elem_expect_all()
and elem_wait_until_all()
for an easy way to test a
single condition on multiple elements.
elem_filter()
and elem_find()
to use conditions to filter elements.
html <- " <div class='class1'> <button id='disabled-button' disabled>Disabled</button> <p>Example text</p> <button id='enabled-button'>Enabled</button> </div> <div class='class3'> </div> " session <- minimal_selenider_session(html) s(".class1") |> elem_expect(is_present) s("#enabled-button") |> elem_expect(is_visible, is_enabled) s("#disabled-button") |> elem_expect(is_disabled) # Error: element is visible but not enabled s("#disabled-button") |> elem_expect(is_visible, is_enabled, timeout = 0.5) |> try() # Since this condition will fail s(".class2") |> elem_expect(!is_present, !is_in_dom, is_absent) # All 3 are equivalent # All other conditions will error if the element does not exist s(".class2") |> elem_expect(is_invisible, timeout = 0.1) |> try() # elem_expect() returns the element, so can be used in chains s("#enabled-button") |> elem_expect(is_visible && is_enabled) |> elem_click() # Note that elem_click() will do this automatically s("p") |> elem_expect(is_visible, has_exact_text("Example text")) # Or use an anonymous function s("p") |> elem_expect(\(elem) identical(elem_text(elem), "Example text")) # If your conditions are not specific to an element, you can omit the `x` # argument elem_1 <- s(".class1") elem_2 <- s(".class2") elem_expect(is_present(elem_1) || is_present(elem_2)) # We can now use the conditions on their own to figure out which element # exists if (is_present(elem_1)) { print("Element 1 is visible") } else { print("Element 2 is visible") } # Use elem_wait_until() to handle failures manually elem <- s(".class2") if (elem_wait_until(elem, is_present)) { elem_click(elem) } else { reload() } # Creating a custom condition is easiest with an anonymous function s("p") |> elem_expect( \(elem) elem |> elem_text() |> grepl(pattern = "Example .*") ) # Or create a function, to reuse the condition multiple times text_contains <- function(x, pattern) { text <- elem_text(x) grepl(pattern, text) } s("p") |> elem_expect(text_contains("Example *")) # If we want to continue on error, we need to use the # "expect_error_continue" class. # This involves making a custom error object. error_condition <- function() { my_condition <- list(message = "Custom error!") class(my_condition) <- c("expect_error_continue", "error", "condition") stop(my_condition) } # This is much easier with rlang::abort() / cli::cli_abort(): error_condition_2 <- function() { rlang::abort("Custom error!", class = "expect_error_continue") } # This error will not be caught try(elem_expect(stop("Uncaught error!"))) # These will eventually throw an error, but will wait 0.5 seconds to do so. try(elem_expect(error_condition(), timeout = 0.5)) try(elem_expect(error_condition_2(), timeout = 0.5))
html <- " <div class='class1'> <button id='disabled-button' disabled>Disabled</button> <p>Example text</p> <button id='enabled-button'>Enabled</button> </div> <div class='class3'> </div> " session <- minimal_selenider_session(html) s(".class1") |> elem_expect(is_present) s("#enabled-button") |> elem_expect(is_visible, is_enabled) s("#disabled-button") |> elem_expect(is_disabled) # Error: element is visible but not enabled s("#disabled-button") |> elem_expect(is_visible, is_enabled, timeout = 0.5) |> try() # Since this condition will fail s(".class2") |> elem_expect(!is_present, !is_in_dom, is_absent) # All 3 are equivalent # All other conditions will error if the element does not exist s(".class2") |> elem_expect(is_invisible, timeout = 0.1) |> try() # elem_expect() returns the element, so can be used in chains s("#enabled-button") |> elem_expect(is_visible && is_enabled) |> elem_click() # Note that elem_click() will do this automatically s("p") |> elem_expect(is_visible, has_exact_text("Example text")) # Or use an anonymous function s("p") |> elem_expect(\(elem) identical(elem_text(elem), "Example text")) # If your conditions are not specific to an element, you can omit the `x` # argument elem_1 <- s(".class1") elem_2 <- s(".class2") elem_expect(is_present(elem_1) || is_present(elem_2)) # We can now use the conditions on their own to figure out which element # exists if (is_present(elem_1)) { print("Element 1 is visible") } else { print("Element 2 is visible") } # Use elem_wait_until() to handle failures manually elem <- s(".class2") if (elem_wait_until(elem, is_present)) { elem_click(elem) } else { reload() } # Creating a custom condition is easiest with an anonymous function s("p") |> elem_expect( \(elem) elem |> elem_text() |> grepl(pattern = "Example .*") ) # Or create a function, to reuse the condition multiple times text_contains <- function(x, pattern) { text <- elem_text(x) grepl(pattern, text) } s("p") |> elem_expect(text_contains("Example *")) # If we want to continue on error, we need to use the # "expect_error_continue" class. # This involves making a custom error object. error_condition <- function() { my_condition <- list(message = "Custom error!") class(my_condition) <- c("expect_error_continue", "error", "condition") stop(my_condition) } # This is much easier with rlang::abort() / cli::cli_abort(): error_condition_2 <- function() { rlang::abort("Custom error!", class = "expect_error_continue") } # This error will not be caught try(elem_expect(stop("Uncaught error!"))) # These will eventually throw an error, but will wait 0.5 seconds to do so. try(elem_expect(error_condition(), timeout = 0.5)) try(elem_expect(error_condition_2(), timeout = 0.5))
elem_expect_all()
and elem_wait_until_all()
are complements to
elem_expect()
and elem_wait_until()
that test conditions on
multiple elements in an element collection.
elem_expect_all(x, ..., testthat = NULL, timeout = NULL) elem_wait_until_all(x, ..., timeout = NULL)
elem_expect_all(x, ..., testthat = NULL, timeout = NULL) elem_wait_until_all(x, ..., timeout = NULL)
x |
A |
... |
< |
testthat |
Whether to treat the expectation as a |
timeout |
The number of seconds to wait for a condition to pass. If not
specified, the timeout used for |
If x
does not contain any elements, elem_expect_all()
and
elem_wait_until_all()
will succeed. You may want to first verify that
at least one element exists with has_at_least()
.
elem_expect_all()
and elem_wait_until_all()
can be thought of as
alternatives to the use of all(vapply(FUN.VALUE = logical(1)))
(or
purrr::every()
) within elem_expect()
and elem_wait_until()
.
For example, the following two expressions are equivalent (where x
is an
element collection).
elem_expect( x, \(element) all(vapply(as.list(element), is_present, logical(1))) ) elem_expect_all(x, is_present)
However, the second example will give a more detailed error message on failure.
elem_expect_all()
returns x
, invisibly.
elem_wait_until_all()
returns a boolean flag: TRUE if the test passes,
FALSE otherwise.
is_present()
and other conditions for predicates for HTML elements.
(If you scroll down to the See also section, you will find the rest).
html <- " <div id='div1'>Content 1</div> <div id='div2'>Content 2</div> <div id='div3' style='display:none;'>Content 3</div> <div id='div4'>Content 4</div> " session <- minimal_selenider_session(html) ss("div") |> elem_expect_all(is_visible, timeout = 0.1) |> try() ss("div")[-3] |> elem_expect_all(is_visible)
html <- " <div id='div1'>Content 1</div> <div id='div2'>Content 2</div> <div id='div3' style='display:none;'>Content 3</div> <div id='div4'>Content 4</div> " session <- minimal_selenider_session(html) ss("div") |> elem_expect_all(is_visible, timeout = 0.1) |> try() ss("div")[-3] |> elem_expect_all(is_visible)
Operators to extract a subset of elements, or a single element, from a selenider element collection.
elem_filter()
and elem_find()
allow you to use conditions to filter HTML
elements (see is_present()
and other conditions). elem_find()
returns
the first element that satisfies one or more conditions, while
elem_filter()
returns every element that satisfies these conditions.
[
and [[
with a numeric subscript can be used on an element collection
to filter the elements by position. [
returns a single element at a
specified location, while [[
returns a collection of the elements at more
than one position.
elem_filter(x, ...) elem_find(x, ...) ## S3 method for class 'selenider_elements' x[i] ## S3 method for class 'selenider_elements' x[[i]]
elem_filter(x, ...) elem_find(x, ...) ## S3 method for class 'selenider_elements' x[i] ## S3 method for class 'selenider_elements' x[[i]]
x |
A |
... |
< |
i |
A number (or for |
As with the find_element()
and find_elements()
functions, these
functions are lazy, meaning that the elements are not fetched and filtered
until they are needed.
Conditions can be functions or function calls (see elem_expect()
for more
details).
elem_filter()
and [
return a selenider_elements
object, since they can
result in multiple elements.
elem_find()
and [[
return a single selenider_element
object.
find_elements()
and ss()
to get elements to filter.
is_present()
and other conditions for predicates on HTML elements.
(If you scroll down to the See also section, you will find the rest).
html <- " <button disabled>Button 1</button> <button>Button 2</button> <p>Text</p> <div style='display:none;'></div> " session <- minimal_selenider_session(html) elements <- ss("*") # Gives the same result as s() elements[[1]] elements[1:3] elements[-2] elements |> elem_filter(is_visible) elements |> elem_find(is_visible) # The above is equivalent to: visible_elems <- elements |> elem_filter(is_visible) visible_elems[[1]] # In R >= 4.3.0, we can instead do: # ss(".class1") |> # elem_filter(is_visible) |> # _[[1]] ss("button") |> elem_filter(is_enabled)
html <- " <button disabled>Button 1</button> <button>Button 2</button> <p>Text</p> <div style='display:none;'></div> " session <- minimal_selenider_session(html) elements <- ss("*") # Gives the same result as s() elements[[1]] elements[1:3] elements[-2] elements |> elem_filter(is_visible) elements |> elem_find(is_visible) # The above is equivalent to: visible_elems <- elements |> elem_filter(is_visible) visible_elems[[1]] # In R >= 4.3.0, we can instead do: # ss(".class1") |> # elem_filter(is_visible) |> # _[[1]] ss("button") |> elem_filter(is_enabled)
elem_flatmap()
previously allowed you to apply a function to each
element in a collection in a lazy manner. This function is now deprecated,
as it did not work in all cases. Use find_each_element()
and
find_all_elements()
instead for the simple case where you want to
select the children of a collection.
element_list()
is a deprecated alias for as.list.selenider_elements()
.
elem_flatmap(x, .f, ...) element_list(x, timeout = NULL)
elem_flatmap(x, .f, ...) element_list(x, timeout = NULL)
x |
A |
.f |
A function that takes a |
... |
Passed into |
timeout |
How long to wait for |
elem_flatmap()
returns a selenider_elements
object.
element_list()
returns a list of selenider_element
objects.
Combine a set of selenider_element
/selenider_elements
objects
into a single selenider_elements
object, allowing you to
perform actions on them at once. c()
and elem_flatten()
do the same
thing, but elem_flatten()
works when given a list of
selenider_element
/selenider_elements
objects.
elem_flatten(...) ## S3 method for class 'selenider_element' c(...) ## S3 method for class 'selenider_elements' c(...)
elem_flatten(...) ## S3 method for class 'selenider_element' c(...) ## S3 method for class 'selenider_elements' c(...)
... |
< |
A selenider_elements
object.
as.list.selenider_elements()
to iterate over element collections.
html <- " <div id='id1'></div> <div class='.class2'></div> <button id='button1'>Click me!</button> <div class='button-container'> <button id='button2'>No, click me!</button> </div> " session <- minimal_selenider_session(html) button_1 <- s("#button1") button_2 <- s("#button2") buttons <- elem_flatten(button_1, button_2) buttons |> elem_expect_all(is_enabled) buttons |> as.list() |> lapply(elem_click) # Doesn't just have to be single elements first_2_divs <- ss("div")[1:2] elem_flatten(first_2_divs, button_2) |> length() # We would like to use multiple css selectors and combine the results selectors <- c( "#id1", # Will select 1 element "button", # Will select 2 elements "p" # Will select 0 elements ) lapply(selectors, ss) |> elem_flatten() |> length() # 3
html <- " <div id='id1'></div> <div class='.class2'></div> <button id='button1'>Click me!</button> <div class='button-container'> <button id='button2'>No, click me!</button> </div> " session <- minimal_selenider_session(html) button_1 <- s("#button1") button_2 <- s("#button2") buttons <- elem_flatten(button_1, button_2) buttons |> elem_expect_all(is_enabled) buttons |> as.list() |> lapply(elem_click) # Doesn't just have to be single elements first_2_divs <- ss("div")[1:2] elem_flatten(first_2_divs, button_2) |> length() # We would like to use multiple css selectors and combine the results selectors <- c( "#id1", # Will select 1 element "button", # Will select 2 elements "p" # Will select 0 elements ) lapply(selectors, ss) |> elem_flatten() |> length() # 3
elem_hover()
moves the mouse over to an HTML element and hovers over it,
without actually clicking or interacting with it.
elem_focus()
focuses an HTML element.
elem_hover(x, js = FALSE, timeout = NULL) elem_focus(x, timeout = NULL)
elem_hover(x, js = FALSE, timeout = NULL) elem_focus(x, timeout = NULL)
x |
A |
js |
Whether to hover over the element using JavaScript. |
timeout |
How long to wait for the element to exist. |
x
, invisibly.
Other actions:
elem_click()
,
elem_scroll_to()
,
elem_select()
,
elem_set_value()
,
elem_submit()
html <- " <button onmouseover = settext()></button> <p class = 'text'></p> " js <- " function settext() { const element = document.getElementsByClassName('text').item(0); element.innerHTML = 'Button hovered!'; } " session <- minimal_selenider_session(html, js = js) elem_expect(s(".text"), has_exact_text("")) s("button") |> elem_hover() elem_expect(s(".text"), has_text("Button hovered!")) s("button") |> elem_focus()
html <- " <button onmouseover = settext()></button> <p class = 'text'></p> " js <- " function settext() { const element = document.getElementsByClassName('text').item(0); element.innerHTML = 'Button hovered!'; } " session <- minimal_selenider_session(html, js = js) elem_expect(s(".text"), has_exact_text("")) s("button") |> elem_hover() elem_expect(s(".text"), has_text("Button hovered!")) s("button") |> elem_focus()
Get the tag name (e.g. "p"
for a <p>
tag) of a selenider_element
object.
elem_name(x, timeout = NULL)
elem_name(x, timeout = NULL)
x |
A |
timeout |
The time to wait for |
A string.
Other properties:
elem_attr()
,
elem_css_property()
,
elem_size()
,
elem_text()
html <- " <div class='mydiv'></div> " session <- minimal_selenider_session(html) s(".mydiv") |> elem_name()
html <- " <div class='mydiv'></div> " session <- minimal_selenider_session(html) s(".mydiv") |> elem_name()
Scrolls to an HTML element.
elem_scroll_to(x, js = FALSE, timeout = NULL)
elem_scroll_to(x, js = FALSE, timeout = NULL)
x |
A |
js |
Whether to scroll to the element using JavaScript. |
timeout |
How long to wait for the element to exist. |
x
, invisibly.
Other actions:
elem_click()
,
elem_hover()
,
elem_select()
,
elem_set_value()
,
elem_submit()
html <- " <div style = 'height:100%; min-height:100vh'></div> <button onclick='checkScrolled()'></button> <p>Scroll down to find me!</p> " js <- " function checkScrolled() { let element = document.getElementsByTagName('p').item(0); let rect = element.getBoundingClientRect(); // If paragraph is in view const height = window.innerHeight || document.documentElement.clientHeight; if (rect.bottom <= height) { element.innerText = 'You found me!'; } } " session <- minimal_selenider_session(html, js = js) s("p") |> elem_scroll_to() s("button") |> elem_click() elem_expect(s("p"), has_text("You found me!"))
html <- " <div style = 'height:100%; min-height:100vh'></div> <button onclick='checkScrolled()'></button> <p>Scroll down to find me!</p> " js <- " function checkScrolled() { let element = document.getElementsByTagName('p').item(0); let rect = element.getBoundingClientRect(); // If paragraph is in view const height = window.innerHeight || document.documentElement.clientHeight; if (rect.bottom <= height) { element.innerText = 'You found me!'; } } " session <- minimal_selenider_session(html, js = js) s("p") |> elem_scroll_to() s("button") |> elem_click() elem_expect(s("p"), has_text("You found me!"))
Select or deselect select
and option
elements.
elem_select( x, value = NULL, text = NULL, index = NULL, timeout = NULL, reset_other = TRUE )
elem_select( x, value = NULL, text = NULL, index = NULL, timeout = NULL, reset_other = TRUE )
x |
A |
value |
If |
text |
The text content of the option to select. This does not have to be a complete match, and multiple options can be selected. |
index |
A vector of indexes. The nth option elements will be selected. |
timeout |
How long to wait for the element to exist. |
reset_other |
If |
If no arguments apart from x
are supplied, and x
is a select
element,
all options will be deselected.
x
, invisibly.
Other actions:
elem_click()
,
elem_hover()
,
elem_scroll_to()
,
elem_set_value()
,
elem_submit()
html <- " <select multiple> <option value='a'>Option A.</option> <option value='b'>Option B.</option> <option value='c'>Option C.</option> </select> " session <- minimal_selenider_session(html) s("select") |> elem_select("a") s("select") |> elem_select(text = c("Option A.", "Option C.")) s("select") |> elem_select(index = 2, reset_other = FALSE) # Reset selection s("select") |> elem_select() s("select") |> elem_select("b")
html <- " <select multiple> <option value='a'>Option A.</option> <option value='b'>Option B.</option> <option value='c'>Option C.</option> </select> " session <- minimal_selenider_session(html) s("select") |> elem_select("a") s("select") |> elem_select(text = c("Option A.", "Option C.")) s("select") |> elem_select(index = 2, reset_other = FALSE) # Reset selection s("select") |> elem_select() s("select") |> elem_select("b")
elem_set_value()
sets the value of an HTML input element to a string.
elem_set_value(x, text, timeout = NULL) elem_send_keys(x, ..., modifiers = NULL, timeout = NULL) elem_clear_value(x, timeout = NULL)
elem_set_value(x, text, timeout = NULL) elem_send_keys(x, ..., modifiers = NULL, timeout = NULL) elem_clear_value(x, timeout = NULL)
x |
A |
text |
A string to set the value of the input element to. |
timeout |
How long to wait for the element to exist. |
... |
A set of inputs to send to |
modifiers |
A character vector; one or more of "shift", "ctrl"/"control", "alt", and "command"/meta". Note that when using chromote as a backend, these do not work on Mac OS. |
elem_send_keys()
sends a set of inputs to an element.
elem_clear_value()
sets the value of an HTML element to ""
, removing any
existing content.
x
, invisibly.
Other actions:
elem_click()
,
elem_hover()
,
elem_scroll_to()
,
elem_select()
,
elem_submit()
html <- " <input type='text' oninput='recordChange(event)' onkeypress='return checkEnter(event);' /> <p></p> " js <- " function recordChange(e) { document.getElementsByTagName('p').item(0).innerText = e.target.value; } function checkEnter(e) { // If the key pressed was Enter if (e.keyCode == 13) { document.getElementsByTagName('p').item(0).innerText = 'Enter pressed!'; return false; } return true; } " session <- minimal_selenider_session(html, js = js) elem_expect(s("p"), has_exact_text("")) input <- s("input") elem_set_value(input, "my text") elem_expect(s("p"), has_text("my text")) elem_clear_value(input) elem_expect(s("p"), has_exact_text("")) elem_send_keys(input, keys$enter) elem_expect(s("p"), has_text("Enter pressed!"))
html <- " <input type='text' oninput='recordChange(event)' onkeypress='return checkEnter(event);' /> <p></p> " js <- " function recordChange(e) { document.getElementsByTagName('p').item(0).innerText = e.target.value; } function checkEnter(e) { // If the key pressed was Enter if (e.keyCode == 13) { document.getElementsByTagName('p').item(0).innerText = 'Enter pressed!'; return false; } return true; } " session <- minimal_selenider_session(html, js = js) elem_expect(s("p"), has_exact_text("")) input <- s("input") elem_set_value(input, "my text") elem_expect(s("p"), has_text("my text")) elem_clear_value(input) elem_expect(s("p"), has_exact_text("")) elem_send_keys(input, keys$enter) elem_expect(s("p"), has_text("Enter pressed!"))
Get the number of elements in a HTML element collection, waiting for the parent elements (if any) to exist before returning a value.
length()
and elem_size()
can be used interchangeably, the only
difference being that elem_size()
allows you to specify a timeout.
elem_size(x, timeout = NULL) ## S3 method for class 'selenider_elements' length(x)
elem_size(x, timeout = NULL) ## S3 method for class 'selenider_elements' length(x)
x |
A |
timeout |
The time to wait for the parent of |
An integer representing the number of elements in the collection.
Other properties:
elem_attr()
,
elem_css_property()
,
elem_name()
,
elem_text()
html <- " <div></div> <div></div> <div></div> <div></div> " session <- minimal_selenider_session(html) ss("div") |> length()
html <- " <div></div> <div></div> <div></div> <div></div> " session <- minimal_selenider_session(html) ss("div") |> length()
If an element is an ancestor of a form, submits the form.
Works by walking up the DOM, checking each ancestor element until
the element is a <form>
element, which it then submits. If such
an element does not exist, an error is thrown.
elem_submit(x, js = FALSE, timeout = NULL)
elem_submit(x, js = FALSE, timeout = NULL)
x |
A |
js |
Whether to submit the form using JavaScript. |
timeout |
How long to wait for the element to exist. |
x
, invisibly.
Other actions:
elem_click()
,
elem_hover()
,
elem_scroll_to()
,
elem_select()
,
elem_set_value()
html <- " <form> <input type='submit'> <p>Random text</p> </form> <a>Random link</a> " session <- minimal_selenider_session(html) elem_submit(s("input")) elem_submit(s("p")) # Won't work since the element doesn't have a form ancestor try(elem_submit(s("a"), timeout = 0.5))
html <- " <form> <input type='submit'> <p>Random text</p> </form> <a>Random link</a> " session <- minimal_selenider_session(html) elem_submit(s("input")) elem_submit(s("p")) # Won't work since the element doesn't have a form ancestor try(elem_submit(s("a"), timeout = 0.5))
Get the inner text of a selenider_element
object.
elem_text(x, timeout = NULL)
elem_text(x, timeout = NULL)
x |
A |
timeout |
The time to wait for |
A string.
Other properties:
elem_attr()
,
elem_css_property()
,
elem_name()
,
elem_size()
html <- " <p>Example text</p> " session <- minimal_selenider_session(html) s("p") |> elem_text()
html <- " <p>Example text</p> " session <- minimal_selenider_session(html) s("p") |> elem_text()
Execute a JavaScript function on zero or more arguments.
execute_js_expr()
is a simpler version of execute_js_fn()
that can
evaluate simple expressions (e.g. "alert()"). To return a value, you must
do so explicitly using "return".
These functions are experimental because their names and parameters are liable to change. Additionally, their behaviour can be inconsistent between different session types (chromote and selenium) and different browsers.
execute_js_fn(fn, ..., .timeout = NULL, .session = NULL, .debug = FALSE) execute_js_expr(expr, ..., .timeout = NULL, .session = NULL, .debug = FALSE)
execute_js_fn(fn, ..., .timeout = NULL, .session = NULL, .debug = FALSE) execute_js_expr(expr, ..., .timeout = NULL, .session = NULL, .debug = FALSE)
fn |
A string defining the function. |
... |
Arguments to the function/expression. These must be unnamed, since JavaScript does not support named arguments. |
.timeout |
How long to wait for any elements to exist in the DOM. |
.session |
The session to use, if |
.debug |
Whether to print the final expression that is executed. Mostly used for debugging the functions themselves, but can also be used to identify problems in your own JavaScript code. |
expr |
An expression to execute. |
...
can contain selenider_element
/selenider_elements
objects,
which will be collected and then passed into the function. However,
more complex objects (e.g. lists of selenider elements) will not be
moved into the JavaScript world correctly.
Similarly, nodes and lists of nodes returned from a JavaScript function will
be converted into their corresponding
selenider_element
/selenider_elements
objects, while more complex objects
will not. These elements are not lazy (see elem_cache()
), so make sure you
only use them while you are sure they are still on the page.
The return value of the JavaScript function, turned back into an R object.
Other global actions:
back()
,
current_url()
,
get_page_source()
,
open_url()
,
reload()
,
scroll_to()
,
take_screenshot()
html <- " <button class='mybutton'>Click me</button> " session <- minimal_selenider_session(html) execute_js_fn("(x, y) => x + y", 1, 1) execute_js_expr("arguments[0] + arguments[1]", 1, 1) execute_js_fn("x => x.click()", s(".mybutton")) execute_js_expr("arguments[0].click()", s(".mybutton"))
html <- " <button class='mybutton'>Click me</button> " session <- minimal_selenider_session(html) execute_js_fn("(x, y) => x + y", 1, 1) execute_js_expr("arguments[0] + arguments[1]", 1, 1) execute_js_fn("x => x.click()", s(".mybutton")) execute_js_expr("arguments[0].click()", s(".mybutton"))
Find HTML child elements from elements in a collection. Provides a convenient way to operate on a collection of elements.
find_each_element()
finds the first child element of each element in
the collection.
find_all_elements()
finds every child element of every element in the
collection.
find_each_element( x, css = NULL, xpath = NULL, id = NULL, class_name = NULL, name = NULL ) find_all_elements( x, css = NULL, xpath = NULL, id = NULL, class_name = NULL, name = NULL )
find_each_element( x, css = NULL, xpath = NULL, id = NULL, class_name = NULL, name = NULL ) find_all_elements( x, css = NULL, xpath = NULL, id = NULL, class_name = NULL, name = NULL )
x |
A |
css |
A CSS selector. |
xpath |
An XPath. |
id |
The id of the elements you want to select. |
class_name |
The class name of the elements you want to select. |
name |
The name attribute of the elements you want to select. |
find_each_element()
will usually preserve the length of the input, since
for each element in the collection, one new element will be found. However,
if an element in the collection cannot be found, it will not be included in
the resulting collection.
find_each_element(x, ...)
is roughly equivalent to:
x |> as.list() |> lapply(\(x) find_element(x, ...)) |> elem_flatten()
Similarly, find_all_elements(x, ...)
is roughly equivalent to:
x |> as.list() |> lapply(\(x) find_elements(x, ...)) |> elem_flatten()
A selenider_elements
object.
as.list()
to iterate over an element collection.
elem_flatten()
to combine multiple
selenider_element
/selenider_elements
objects into a single object.
html <- " <div id='div1'> <p>Text 1</p> <button>Button 1</button> </div> <div id='div2'> <p>Text 2</p> </div> <div id='div3'> <p>Text 3</p> </div> <div id='div4'> <p>Text 4</p> </div> " session <- minimal_selenider_session(html) divs <- ss("div") # Get the <p> tag inside each div. divs |> find_each_element("p") # Get the <button> tag in the first div as well. divs |> find_all_elements("*")
html <- " <div id='div1'> <p>Text 1</p> <button>Button 1</button> </div> <div id='div2'> <p>Text 2</p> </div> <div id='div3'> <p>Text 3</p> </div> <div id='div4'> <p>Text 4</p> </div> " session <- minimal_selenider_session(html) divs <- ss("div") # Get the <p> tag inside each div. divs |> find_each_element("p") # Get the <button> tag in the first div as well. divs |> find_all_elements("*")
Find the first HTML element using a CSS selector, an XPath, or a variety of other methods.
find_element(x, ...) ## S3 method for class 'selenider_session' find_element( x, css = NULL, xpath = NULL, id = NULL, class_name = NULL, name = NULL, ... ) ## S3 method for class 'selenider_element' find_element( x, css = NULL, xpath = NULL, id = NULL, class_name = NULL, name = NULL, ... )
find_element(x, ...) ## S3 method for class 'selenider_session' find_element( x, css = NULL, xpath = NULL, id = NULL, class_name = NULL, name = NULL, ... ) ## S3 method for class 'selenider_element' find_element( x, css = NULL, xpath = NULL, id = NULL, class_name = NULL, name = NULL, ... )
x |
A selenider session or element. |
... |
Arguments passed to methods. |
css |
A css selector. |
xpath |
An XPath. |
id |
The id of the element you want to select. |
class_name |
The class name of the element you want to select. |
name |
The name attribute of the element you want to select. |
If more than one method is used to select an element (e.g. css
and
xpath
), the first element which satisfies all conditions will be found.
CSS selectors are generally recommended over other options, since they are
usually the easiest to read. Use "tag_name"
to select by tag name,
".class"
to select by class, and "#id"
to select by id.
A selenider_element
object.
s()
to quickly select an element without specifying the session.
find_elements()
to select multiple elements.
selenider_session()
to begin a session.
html <- " <div class='class1'> <div id='id1'> <p class='class2'>Example text</p> </div> <p>Example text</p> </div> " session <- minimal_selenider_session(html) session |> find_element("div") session |> find_element("div") |> find_element(xpath = "./p") s("div") |> find_element("#id1") s("div") |> find_element(id = "id1") |> find_element(class_name = "class2") s(xpath = "//div[contains(@class, 'class1')]/div/p") # Complex Xpath expressions are easier to read as chained CSS selectors. # This is equivalent to above s("div.class1") |> find_element("div") |> find_element("p")
html <- " <div class='class1'> <div id='id1'> <p class='class2'>Example text</p> </div> <p>Example text</p> </div> " session <- minimal_selenider_session(html) session |> find_element("div") session |> find_element("div") |> find_element(xpath = "./p") s("div") |> find_element("#id1") s("div") |> find_element(id = "id1") |> find_element(class_name = "class2") s(xpath = "//div[contains(@class, 'class1')]/div/p") # Complex Xpath expressions are easier to read as chained CSS selectors. # This is equivalent to above s("div.class1") |> find_element("div") |> find_element("p")
Find every available HTML element using a CSS selector, an XPath, or a variety of other methods.
find_elements(x, ...) ## S3 method for class 'selenider_session' find_elements( x, css = NULL, xpath = NULL, id = NULL, class_name = NULL, name = NULL, ... ) ## S3 method for class 'selenider_element' find_elements( x, css = NULL, xpath = NULL, id = NULL, class_name = NULL, name = NULL, ... )
find_elements(x, ...) ## S3 method for class 'selenider_session' find_elements( x, css = NULL, xpath = NULL, id = NULL, class_name = NULL, name = NULL, ... ) ## S3 method for class 'selenider_element' find_elements( x, css = NULL, xpath = NULL, id = NULL, class_name = NULL, name = NULL, ... )
x |
A selenider session or element. |
... |
Arguments passed to methods. |
css |
A css selector. |
xpath |
An XPath. |
id |
The id of the element you want to select. |
class_name |
The class name of the element you want to select. |
name |
The name attribute of the element you want to select. |
If more than one method is used to select an element (e.g. css
and
xpath
), the first element which satisfies every condition will be found.
A selenider_elements
object. Note that this is not a list, and you should
be careful with the functions that you use with it. See the advanced usage
vignette for more details:
vignette("advanced-usage", package = "selenider")
.
ss()
to quickly select multiple elements without specifying the session.
find_element()
to select a single element.
selenider_session()
to begin a session.
elem_children()
and family to select elements using their relative
position in the DOM.
elem_filter()
and elem_find()
for filtering element collections.
as.list.selenider_elements()
to convert a selenider_elements
object
to a list.
html <- " <div id='outer-div'> <div> <p>Text 1</p> <p>Text 2</p> <p>Text 3</p> </div> </div> <div></div> " session <- minimal_selenider_session(html) session |> find_elements("div") # Or: ss("div") session |> find_element("#outer-div") |> find_elements("p") # The above can be shortened to: s("#outer-div") |> find_elements("p")
html <- " <div id='outer-div'> <div> <p>Text 1</p> <p>Text 2</p> <p>Text 3</p> </div> </div> <div></div> " session <- minimal_selenider_session(html) session |> find_elements("div") # Or: ss("div") session |> find_element("#outer-div") |> find_elements("p") # The above can be shortened to: s("#outer-div") |> find_elements("p")
Turn a lazy selenium element or element collection into a backendNodeId (chromote) or a selenium::WebElement. Use this to perform certain actions on the element that are not implemented in selenider.
get_actual_element()
turns a selenider_element
object into a single
backendNodeId or selenium::WebElement object. The function will wait for
the object to exist in the DOM.
get_actual_elements()
turns a selenider_elements
object into a list
of selenium::WebElement objects, waiting for any parent objects to
exist in the DOM.
get_actual_element(x, timeout = NULL) get_actual_elements(x, timeout = NULL)
get_actual_element(x, timeout = NULL) get_actual_elements(x, timeout = NULL)
x |
A |
timeout |
The timeout to use while asserting that the item exists. If
NULL, the timeout of the |
An integer (backendNodeId), or a selenium::WebElement object.
get_actual_elements()
returns a list of such objects.
s()
, ss()
, find_element()
and find_elements()
to select selenider
elements.
elem_cache()
and elem_cache()
to cache these values.
The Chrome Devtools Protocol documentation
for the operations that can be performed using a backend node id. Note
that this requires the chromote::ChromoteSession object, which can be
retrieved using <selenider_session>$driver
.
The documentation for selenium::WebElement()
to see the things you can
do with a webElement.
html <- " <div> <p>Text</p> <p>More text</p> </div> " session <- minimal_selenider_session(html) elem <- s("div") |> get_actual_element() # The ChromoteSession/SeleniumSession can be accessed using session$driver driver <- session$driver if (inherits(driver, "ChromoteSession")) { driver$DOM$getBoxModel(backendNodeId = elem) } else if (inherits(elem, "WebElement")) { elem$get_rect() } elems <- ss("p") |> get_actual_elements() if (inherits(driver, "ChromoteSession")) { driver$DOM$describeNode(backendNodeId = elems[[1]]) } else if (inherits(elems[[1]], "WebElement")) { elems[[1]]$get_rect() }
html <- " <div> <p>Text</p> <p>More text</p> </div> " session <- minimal_selenider_session(html) elem <- s("div") |> get_actual_element() # The ChromoteSession/SeleniumSession can be accessed using session$driver driver <- session$driver if (inherits(driver, "ChromoteSession")) { driver$DOM$getBoxModel(backendNodeId = elem) } else if (inherits(elem, "WebElement")) { elem$get_rect() } elems <- ss("p") |> get_actual_elements() if (inherits(driver, "ChromoteSession")) { driver$DOM$describeNode(backendNodeId = elems[[1]]) } else if (inherits(elems[[1]], "WebElement")) { elems[[1]]$get_rect() }
Uses xml2::read_html()
to read the page source of the session
get_page_source(session = NULL, ...)
get_page_source(session = NULL, ...)
session |
Optionally, a |
... |
Passed into |
An XML document.
Other global actions:
back()
,
current_url()
,
execute_js_fn()
,
open_url()
,
reload()
,
scroll_to()
,
take_screenshot()
html <- " <p>Example text</p> " session <- minimal_selenider_session(html) get_page_source()
html <- " <p>Example text</p> " session <- minimal_selenider_session(html) get_page_source()
Change the locally defined selenider_session()
object, allowing it to be
used in functions like s()
without explicitly providing it.
get_session()
retrieves the current local session. If none have been
created, a session is created automatically.
local_session()
sets the local session. The function uses withr::defer()
to make sure the session is closed and the local session is set to its
previous value when it is no longer needed.
with_session()
runs some code with a temporary local session. The session
is closed and the local session is set to its previous value when the code
finishes executing.
get_session(create = TRUE, .env = rlang::caller_env()) local_session(session, .local_envir = rlang::caller_env(), close = TRUE) with_session(session, code, close = TRUE)
get_session(create = TRUE, .env = rlang::caller_env()) local_session(session, .local_envir = rlang::caller_env(), close = TRUE) with_session(session, code, close = TRUE)
create |
If a session is not found, should we create a new one? If this
is |
.env |
If |
session |
The |
.local_envir |
The environment where the session is being used. When the function associated with this environment finishes execution, the session will be reset. |
close |
Should we close |
code |
The code to run with the local session set. |
Use withr::deferred_run()
to reset any local sessions set using
local_session()
.
get_session()
returns the local selenider_session()
object (or a newly
created session).
local_session()
returns the previous local session object (or NULL
).
This is the same as running get_session()
before this function.
with_session()
returns the result of code
.
selenider_session()
, which calls local_session()
unless otherwise
specified.
# Don't set the local session, since we want to do it manually. session <- selenider_session(local = FALSE) get_session(create = FALSE) # NULL local_session(session, close = FALSE) get_session(create = FALSE) withr::deferred_run() get_session(create = FALSE) # NULL # By default, the local session is only set inside the function that it is # called. # If we want to set the local session outside the scope of a function, we # need to use the `.local_envir` argument. set_my_session <- function(env = rlang::caller_env()) { # caller_env() is the environment where the function is called. local_session(session, .local_envir = env, close = FALSE) } set_my_session() with_session( session, { get_session(create = FALSE) }, close = FALSE ) get_session(create = FALSE)
# Don't set the local session, since we want to do it manually. session <- selenider_session(local = FALSE) get_session(create = FALSE) # NULL local_session(session, close = FALSE) get_session(create = FALSE) withr::deferred_run() get_session(create = FALSE) # NULL # By default, the local session is only set inside the function that it is # called. # If we want to set the local session outside the scope of a function, we # need to use the `.local_envir` argument. set_my_session <- function(env = rlang::caller_env()) { # caller_env() is the environment where the function is called. local_session(session, .local_envir = env, close = FALSE) } set_my_session() with_session( session, { get_session(create = FALSE) }, close = FALSE ) get_session(create = FALSE)
has_attr()
checks that an element's attribute matches a value, while
attr_contains()
checks that an element's attribute contains a value.
has_value()
is a shortcut for has_attr("value")
: it checks that an
element's value matches a string or number.
has_attr(x, name, value) attr_contains(x, name, value) has_value(x, value)
has_attr(x, name, value) attr_contains(x, name, value) has_value(x, value)
x |
A |
name |
The name of the attribute. |
value |
The value of the attribute. For |
A boolean value: TRUE or FALSE.
Other conditions:
has_css_property()
,
has_length()
,
has_name()
,
has_text()
,
is_enabled()
,
is_present()
,
is_visible()
html <- " <input class='myclass' value='1.0' data-customattr='Custom attribute text'> " session <- minimal_selenider_session(html) has_attr(s("input"), "class", "myclass") has_attr(s("input"), "value", 1) has_value(s("input"), 1) attr_contains(s("input"), "data-customattr", "Custom attribute")
html <- " <input class='myclass' value='1.0' data-customattr='Custom attribute text'> " session <- minimal_selenider_session(html) has_attr(s("input"), "class", "myclass") has_attr(s("input"), "value", 1) has_value(s("input"), 1) attr_contains(s("input"), "data-customattr", "Custom attribute")
Check that the CSS property (e.g. "background-color"
) of an element matches
a value.
has_css_property(x, property, value)
has_css_property(x, property, value)
x |
A |
property |
The name of the CSS property |
value |
The value of the attribute. |
A boolean value: TRUE or FALSE.
Other conditions:
has_attr()
,
has_length()
,
has_name()
,
has_text()
,
is_enabled()
,
is_present()
,
is_visible()
html <- " <div style='display:none;'></div> " session <- minimal_selenider_session(html) has_css_property(s("div"), "display", "none")
html <- " <div style='display:none;'></div> " session <- minimal_selenider_session(html) has_css_property(s("div"), "display", "none")
has_length()
and has_size()
checks that a collection of HTML elements
contains a certain number of elements.
has_length(x, n) has_size(x, n) has_at_least(x, n)
has_length(x, n) has_size(x, n) has_at_least(x, n)
x |
A |
n |
A numeric vector of possible lengths of |
has_at_least()
checks that a collection contains at least n
elements.
These functions do not implement a retry mechanism, and only test a condition
once. Use elem_expect()
or elem_wait_until()
to use these conditions in
tests.
A boolean value: TRUE
or FALSE
Other conditions:
has_attr()
,
has_css_property()
,
has_name()
,
has_text()
,
is_enabled()
,
is_present()
,
is_visible()
html <- " <div class='div1'></div> <div class='div2'></div> <div class='div3'></div> " session <- minimal_selenider_session(html) has_length(ss("div"), 3) has_at_least(ss("div"), 2)
html <- " <div class='div1'></div> <div class='div2'></div> <div class='div3'></div> " session <- minimal_selenider_session(html) has_length(ss("div"), 3) has_at_least(ss("div"), 2)
Check that an element has a specified tag name
has_name(x, name)
has_name(x, name)
x |
A |
name |
A string. |
A boolean value.
Other conditions:
has_attr()
,
has_css_property()
,
has_length()
,
has_text()
,
is_enabled()
,
is_present()
,
is_visible()
html <- " <div id='mydiv'></div> " session <- minimal_selenider_session(html) has_name(s("#mydiv"), "p") has_name(s("#mydiv"), "div")
html <- " <div id='mydiv'></div> " session <- minimal_selenider_session(html) has_name(s("#mydiv"), "p") has_name(s("#mydiv"), "div")
has_text()
checks that an element's inner text contains a string, while
has_exact_text()
checks that the inner text only contains the string.
Both functions throw an error if the element does not exist in the DOM.
has_text(x, text) has_exact_text(x, text)
has_text(x, text) has_exact_text(x, text)
x |
A |
text |
A string, used to test the element's inner text. |
These functions do not implement a retry mechanism, and only test a condition
once. Use elem_expect()
or elem_wait_until()
to use these conditions in
tests.
A boolean value: TRUE or FALSE.
Other conditions:
has_attr()
,
has_css_property()
,
has_length()
,
has_name()
,
is_enabled()
,
is_present()
,
is_visible()
html <- " <p>Example text</p> <p class='empty'></p> " session <- minimal_selenider_session(html) has_text(s("p"), "Example") # TRUE has_exact_text(s("p"), "Example") # FALSE has_exact_text(s("p"), "Example text") # TRUE # has_exact_text() is useful for checking when there is no text, # since has_text("") will always be TRUE. has_exact_text(s(".empty"), "")
html <- " <p>Example text</p> <p class='empty'></p> " session <- minimal_selenider_session(html) has_text(s("p"), "Example") # TRUE has_exact_text(s("p"), "Example") # FALSE has_exact_text(s("p"), "Example text") # TRUE # has_exact_text() is useful for checking when there is no text, # since has_text("") will always be TRUE. has_exact_text(s(".empty"), "")
is_disabled()
checks that an element has the disabled
attribute set to
TRUE
, while is_enabled()
checks that it does not. Both functions throw an
error if the element does not exist in the DOM.
is_enabled(x) is_disabled(x)
is_enabled(x) is_disabled(x)
x |
A |
These functions do not implement a retry mechanism, and only test a condition
once. Use elem_expect()
or elem_wait_until()
to use these conditions in
tests.
A boolean value: TRUE or FALSE.
Other conditions:
has_attr()
,
has_css_property()
,
has_length()
,
has_name()
,
has_text()
,
is_present()
,
is_visible()
html <- " <button></button> <button disabled></button> " session <- minimal_selenider_session(html) is_enabled(s("button")) # TRUE is_disabled(ss("button")[[2]]) # TRUE
html <- " <button></button> <button disabled></button> " session <- minimal_selenider_session(html) is_enabled(s("button")) # TRUE is_disabled(ss("button")[[2]]) # TRUE
is_present()
and is_in_dom()
checks if an element is present on the page,
while is_missing()
and is_absent()
checks the opposite.
is_present(x) is_in_dom(x) is_absent(x)
is_present(x) is_in_dom(x) is_absent(x)
x |
A |
These functions do not implement a retry mechanism, and only test a condition
once. Use elem_expect()
or elem_wait_until()
to use these conditions in
tests.
A boolean value: TRUE or FALSE.
Other conditions:
has_attr()
,
has_css_property()
,
has_length()
,
has_name()
,
has_text()
,
is_enabled()
,
is_visible()
html <- " <p class='class1'></p> " session <- minimal_selenider_session(html) is_present(s(".class1")) # TRUE is_in_dom(s(".class2")) # FALSE is_absent(s(".class2")) # TRUE
html <- " <p class='class1'></p> " session <- minimal_selenider_session(html) is_present(s(".class1")) # TRUE is_in_dom(s(".class2")) # FALSE is_absent(s(".class2")) # TRUE
is_visible()
and is_displayed()
checks that an element can be seen on the
page, while is_invisible()
and is_hidden()
checks the opposite. All
functions throw an error if the element is not in the DOM.
is_visible(x) is_displayed(x) is_hidden(x) is_invisible(x)
is_visible(x) is_displayed(x) is_hidden(x) is_invisible(x)
x |
A |
These functions do not implement a retry mechanism, and only test a condition
once. Use elem_expect()
or elem_wait_until()
to use these conditions in
tests.
A boolean value: TRUE or FALSE.
Other conditions:
has_attr()
,
has_css_property()
,
has_length()
,
has_name()
,
has_text()
,
is_enabled()
,
is_present()
html <- " <div style='visibility:hidden;'>Content 1</div> <div style='display:none'>Content 2</div> <div>Content 3</div> " session <- minimal_selenider_session(html) is_visible(s("div")) # FALSE is_invisible(ss("div")[[2]]) # TRUE is_visible(ss("div")[[3]]) # TRUE
html <- " <div style='visibility:hidden;'>Content 1</div> <div style='display:none'>Content 2</div> <div>Content 3</div> " session <- minimal_selenider_session(html) is_visible(s("div")) # FALSE is_invisible(ss("div")[[2]]) # TRUE is_visible(ss("div")[[3]]) # TRUE
List of special keys, for use with elem_send_keys()
.
keys
keys
A list containing selenider_key
objects.
keys$backspace
keys$backspace
Create a selenider_session
using custom HTML/JavaScript.
minimal_selenider_session(html, js = NULL, ..., .env = rlang::caller_env())
minimal_selenider_session(html, js = NULL, ..., .env = rlang::caller_env())
html |
A string to use as HTML. Can also be an |
js |
A string (or |
... |
Passed into |
.env |
The environment in which the session will be used. |
The function works by combining html
and js
into a single string, then
writing this to a temporary file (and opening it in the session's browser).
A selenider_session
object.
session <- minimal_selenider_session("<p>Example</p>")
session <- minimal_selenider_session("<p>Example</p>")
Navigate the browser to specified URL, waiting until the page is considered open before finishing.
open_url(url, timeout = NULL, session = NULL)
open_url(url, timeout = NULL, session = NULL)
url |
The URL to navigate to: a string. |
timeout |
The maximum time to wait for the page to load, in seconds. This defaults to 60, unless in a Github Action, in which case it defaults to 5 minutes. |
session |
A |
The session object, invisibly.
Other global actions:
back()
,
current_url()
,
execute_js_fn()
,
get_page_source()
,
reload()
,
scroll_to()
,
take_screenshot()
session <- selenider_session() open_url("https://r-project.org") # Or: open_url(session = session, "https://r-project.org")
session <- selenider_session() open_url("https://r-project.org") # Or: open_url(session = session, "https://r-project.org")
Display a summary of the steps needed to reach an element. This function is deprecated, as it is not useful for most users.
print_lazy(x, ...) ## S3 method for class 'selenider_element' print_lazy(x, ...) ## S3 method for class 'selenider_elements' print_lazy(x, ...)
print_lazy(x, ...) ## S3 method for class 'selenider_element' print_lazy(x, ...) ## S3 method for class 'selenider_elements' print_lazy(x, ...)
x |
A |
... |
Not used. |
x
, invisibly.
Display an element or collection of elements by fetching the elements and displaying their HTML contents.
## S3 method for class 'selenider_element' print(x, width = getOption("width"), ..., timeout = NULL) ## S3 method for class 'selenider_elements' print(x, width = getOption("width"), ..., n = 20, timeout = NULL)
## S3 method for class 'selenider_element' print(x, width = getOption("width"), ..., timeout = NULL) ## S3 method for class 'selenider_elements' print(x, width = getOption("width"), ..., n = 20, timeout = NULL)
x |
A |
width |
The maximum width of the output. |
... |
Not used. |
timeout |
How long to wait for |
n |
The maximum number of elements to print. |
x
, invisibly.
html <- " <div> <p>Text 1</p> <p>Text 2</p> <p>Text 3</p> <p>Text 4</p> </div> " session <- minimal_selenider_session(html) print(s("div")) print(ss("p")) print(ss("p"), n = 3)
html <- " <div> <p>Text 1</p> <p>Text 2</p> <p>Text 3</p> <p>Text 4</p> </div> " session <- minimal_selenider_session(html) print(s("div")) print(ss("p")) print(ss("p"), n = 3)
xml2::read_html()
can be used on a selenider session to read the HTML of
the entire page, or on a selenider element to get the HTML of that element.
read_html.selenider_session( x, encoding = "", ..., options = c("RECOVER", "NOERROR", "NOBLANKS") ) read_html.selenider_element( x, encoding = "", timeout = NULL, outer = TRUE, ..., options = c("RECOVER", "NOERROR", "NOBLANKS") )
read_html.selenider_session( x, encoding = "", ..., options = c("RECOVER", "NOERROR", "NOBLANKS") ) read_html.selenider_element( x, encoding = "", timeout = NULL, outer = TRUE, ..., options = c("RECOVER", "NOERROR", "NOBLANKS") )
x |
A |
encoding , ... , options
|
Passed into |
timeout |
How long to wait for |
outer |
Whether to read the inner (all children of the current element)
or outer (including the element itself) HTML of |
read_html()
returns an XML document. Note that HTML will always be wrapped
in a <html>
and <body>
tag, if it isn't already.
library(rvest) html <- " <div> <p>Example text</p> </div> " session <- minimal_selenider_session(html) read_html(session) read_html(s("div"))
library(rvest) html <- " <div> <p>Example text</p> </div> " session <- minimal_selenider_session(html) read_html(session) read_html(s("div"))
reload()
and refresh()
both reload the current page.
reload(timeout = NULL, session = NULL) refresh(timeout = NULL, session = NULL)
reload(timeout = NULL, session = NULL) refresh(timeout = NULL, session = NULL)
timeout |
The maximum time to wait for the page to load, in seconds. This defaults to 60, unless in a Github Action, in which case it defaults to 5 minutes. |
session |
A |
The session object, invisibly.
Other global actions:
back()
,
current_url()
,
execute_js_fn()
,
get_page_source()
,
open_url()
,
scroll_to()
,
take_screenshot()
session <- selenider_session() open_url("https://r-project.org") reload()
session <- selenider_session() open_url("https://r-project.org") reload()
Both s()
and ss()
allow you to select elements without specifying a
session object.
s()
selects a single element, being a shorthand for find_element()
on the current session.
ss()
selects multiple elements, being a shorthand for find_elements()
.
s(css = NULL, xpath = NULL, id = NULL, class_name = NULL, name = NULL) ss(css = NULL, xpath = NULL, id = NULL, class_name = NULL, name = NULL)
s(css = NULL, xpath = NULL, id = NULL, class_name = NULL, name = NULL) ss(css = NULL, xpath = NULL, id = NULL, class_name = NULL, name = NULL)
css |
A css selector. |
xpath |
An XPath. |
id |
The id of the element you want to select. |
class_name |
The class name of the element you want to select. |
name |
The name attribute of the element you want to select. |
Both functions allow the starting point for chains of selectors to be made
more concise. Both use get_session()
to get the global session object.
If you want to pass in a session, use find_element()
/find_elements()
instead.
s()
returns a selenider_element
object.
ss()
returns a selenider_elements
object. Note that this is not a list,
and you should be careful with the functions that you use with it. See the
advanced usage vignette for more details:
vignette("advanced-usage", package = "selenider")
.
selenider_session()
to begin a session.
html <- " <div> <p id='id1' class='inner'></p> <div class='child'> <p class='inner'></p> </div> </div> " session <- minimal_selenider_session(html) s("#id1") # This is the equivalent of: find_element(session, "#id1") ss(".inner") # This is the equivalent of: find_element(session, ".inner") # This provides a more concise way to begin a chain of selectors s("div") |> find_element(".child") |> find_element(".inner")
html <- " <div> <p id='id1' class='inner'></p> <div class='child'> <p class='inner'></p> </div> </div> " session <- minimal_selenider_session(html) s("#id1") # This is the equivalent of: find_element(session, "#id1") ss(".inner") # This is the equivalent of: find_element(session, ".inner") # This provides a more concise way to begin a chain of selectors s("div") |> find_element(".child") |> find_element(".inner")
scroll_to()
scrolls the page to the specified coordinates.
scroll_by()
scrolls the page by the specified amount.
scroll_to(top = 0, left = 0, session = NULL) scroll_by(top = 0, left = 0, session = NULL)
scroll_to(top = 0, left = 0, session = NULL) scroll_by(top = 0, left = 0, session = NULL)
top |
The vertical scroll position/offset, in pixels. |
left |
The horizontal scroll position/offset, in pixels. |
session |
A |
Other global actions:
back()
,
current_url()
,
execute_js_fn()
,
get_page_source()
,
open_url()
,
reload()
,
take_screenshot()
session <- selenider_session() open_url("https://r-project.org") scroll_to(100, 100) # Scrolls an additional 100 pixels down scroll_by(100)
session <- selenider_session() open_url("https://r-project.org") scroll_to(100, 100) # Scrolls an additional 100 pixels down scroll_by(100)
Checks if selenider's dependencies are available, and that we are in an environment where it makes sense to open a selenider session.
skip_if_selenider_unavailable()
skips a testthat test if
selenider_available()
returns FALSE
.
selenider_available(session = c("chromote", "selenium"), online = TRUE) skip_if_selenider_unavailable(session = c("chromote", "selenium"))
selenider_available(session = c("chromote", "selenium"), online = TRUE) skip_if_selenider_unavailable(session = c("chromote", "selenium"))
session |
Which session we should check. |
online |
Whether we need to check for an internet connection. |
Specifically, the following is checked:
The SELENIDER_AVAILABLE
environment variable. Set this to "TRUE"
or
"FALSE"
to override the return value of this function.
Whether we are on CRAN (using the NOT_CRAN
environment variable). If we
are, the function returns FALSE
.
Whether an internet connection is available (using curl::nslookup()
).
If session
is "chromote"
, we also check:
Whether chromote
is installed.
Whether chromote::find_chrome()
does not error.
If session
is "selenium"
, we check:
Whether selenium
is installed.
Whether we can find a valid browser that is supported by selenium
.
A boolean flag: TRUE
or FALSE
.
selenider_available()
selenider_available()
Create a session in selenider. If you create a session in the global
environment it will be made available to other functions (like
open_url()
) without having to pass it in, and will close automatically
when the R session is closed. Alternatively, if it is created inside a
function, it will be closed as soon as the function finishes executing. To
customise this, use the .env
and local
arguments.
selenider_session( session = getOption("selenider.session"), browser = getOption("selenider.browser"), timeout = 4, options = NULL, driver = NULL, local = TRUE, .env = rlang::caller_env(), view = FALSE, selenium_manager = TRUE, quiet = TRUE )
selenider_session( session = getOption("selenider.session"), browser = getOption("selenider.browser"), timeout = 4, options = NULL, driver = NULL, local = TRUE, .env = rlang::caller_env(), view = FALSE, selenium_manager = TRUE, quiet = TRUE )
session |
The package to use as a backend: either "chromote",
"selenium". By default, chromote is used, since this tends to be faster
and more reliable. Change the default value using the |
browser |
The name of the browser to run the session in; one of
"chrome", "firefox", "edge", "safari", or another valid browser name.
If |
timeout |
The default time to wait when collecting an element. |
options |
A |
driver |
A driver object to use instead of creating one manually. This can be one of:
|
local |
Whether to set the session as the local session object,
using |
.env |
Passed into |
view , selenium_manager , quiet
|
A selenider_session
object. Use session$driver
to retrieve the driver
object that controls the browser.
See chromote_options()
and selenium_options()
for the full range.
By default, chromote will run in headless mode, meaning that you won't
actually be able to see the browser as you control it. Use the view
argument to chromote_options()
to change this:
session <- selenider_session( options = chromote_options(view = TRUE) )
Sometimes, you want to manage the Selenium server separately, and only let
selenider create client objects to attach to the server. You can do this by
passing NULL
into the server_options
argument to selenium_options()
:
session <- selenider_session( "selenium", options = selenium_options(server_options = NULL) )
If the port you are using is not 4444, you will need to pass in the port
argument to selenium_client_options()
as well:
session <- selenider_session( "selenium", options = selenium_options( client_options = selenium_client_options(port = YOUR_PORT), server_options = NULL ) )
One example of when this may be useful is when you are managing the Selenium server using Docker.
By default, selenium will download and store the Selenium server JAR file
in a temporary directory, which will be deleted when the R session finishes.
This means that every time you start a new R session, this file will be
re-downloaded. You can store the JAR file permanently using the temp
argument to selenium_server_options()
:
session <- selenider_session( "selenium", options = selenium_options( server_options = selenium_server_options(temp = TRUE) ) )
The downside of this is you may end up using a lot of storage, especially if a new version of Selenium is released and the old server file is left on the filesystem.
You can also use the path
argument to selenium_server_options()
to
specify the directory where the JAR file should be stored.
A selenider_session
object has several components that can be useful to
access:
session
- The type of session, either "chromote"
or "selenium"
.
driver
- The driver object used to control the browser. This is either a
chromote::ChromoteSession or selenium::SeleniumSession object. This is
useful if you want to do something with the driver that is not directly
supported by selenider. See get_actual_element()
for some examples of
this.
server
- The Selenium server object, if one was created or passed in.
id
- A unique ID that can be used to identify the session.
Access these components using $
(e.g. session$driver
).
If you want complete manual control over creating the underlying driver,
you can pass your own driver
argument to stop selenider from creating
the driver for you.
You can also supply a shinytest2::AppDriver object, allowing selenider and shinytest2 to share a session:
shiny_app <- shiny::shinyApp( ui = shiny::fluidPage( # ... Your UI ), server = function(input, output) { # ... Your server } ) app <- shinytest2::AppDriver$new() session <- selenider_session( driver = app )
close_session()
to close the session. Note that this will not reset the
result of get_session()
, which is why withr::deferred_run()
is
preferred.
local_session()
and with_session()
to manually set the local session
object (and get_session()
to get it).
open_url()
, s()
and find_elements()
to get started once you have
created a session.
session_1 <- selenider_session(timeout = 10) # session_1 is the local session here get_session() # Returns session 1 my_function <- function() { session_2 <- selenider_session() # In here, session_2 is the local session get_session() } # When the function finishes executing, the session is closed my_function() # Returns `session_2` # If we want to use a session outside the scope of a function, # we need to use the `.env` argument. create_session <- function(timeout = 10, .env = rlang::caller_env()) { # caller_env() is the environment where the function is called selenider_session(timeout = timeout, .env = .env) } my_session <- create_session() # We can now use this session outside the `create_session()` function get_session() # `my_session` will be closed automatically.
session_1 <- selenider_session(timeout = 10) # session_1 is the local session here get_session() # Returns session 1 my_function <- function() { session_2 <- selenider_session() # In here, session_2 is the local session get_session() } # When the function finishes executing, the session is closed my_function() # Returns `session_2` # If we want to use a session outside the scope of a function, # we need to use the `.env` argument. create_session <- function(timeout = 10, .env = rlang::caller_env()) { # caller_env() is the environment where the function is called selenider_session(timeout = timeout, .env = .env) } my_session <- create_session() # We can now use this session outside the `create_session()` function get_session() # `my_session` will be closed automatically.
selenider
has a few options, allowing you to specify the session and
browser to use without having to tell selenider_session()
this information
every time.
selenider.session
- The package to use as a backend: either "chromote",
"selenium" or "rselenium".
selenider.browser
- The name of the browser to run the session in; one
of "chrome", "firefox", "edge", "safari", or another valid browser name.
Take a screenshot of the current session state, saving this image to a file.
take_screenshot(file = NULL, view = FALSE, session = NULL)
take_screenshot(file = NULL, view = FALSE, session = NULL)
file |
The file path to save the screenshot to. |
view |
Whether to open the interactively view the screenshot. If this is
|
session |
A |
file
, if it is not NULL
. Otherwise, the session object is returned, invisibly.
Other global actions:
back()
,
current_url()
,
execute_js_fn()
,
get_page_source()
,
open_url()
,
reload()
,
scroll_to()
session <- selenider_session() open_url("https://www.google.com") file_path <- withr::local_tempfile(fileext = ".png") take_screenshot(file_path)
session <- selenider_session() open_url("https://www.google.com") file_path <- withr::local_tempfile(fileext = ".png") take_screenshot(file_path)
Instruct selenider to use RSelenium instead of selenium. Passed into
selenium_options()
. This is not recommended, since RSelenium does not
support the latest version of Selenium, and wdman (the server manager that
RSelenium) uses, is not compatible with the latest version of Chrome.
wdman_server_options( version = "latest", driver_version = "latest", port = 4444L, check = TRUE, verbose = FALSE, retcommand = FALSE, ... ) rselenium_client_options( port = 4444L, host = "localhost", path = "/wd/hub", version = "", platform = "ANY", javascript = TRUE, native_events = TRUE, extra_capabilities = list() )
wdman_server_options( version = "latest", driver_version = "latest", port = 4444L, check = TRUE, verbose = FALSE, retcommand = FALSE, ... ) rselenium_client_options( port = 4444L, host = "localhost", path = "/wd/hub", version = "", platform = "ANY", javascript = TRUE, native_events = TRUE, extra_capabilities = list() )
version |
The version of Selenium server to use. |
driver_version |
The version of the browser-specific driver to use. |
port |
The port to run selenium client/server on. |
check , verbose , retcommand , ...
|
Passed into |
host |
The host to connect to the selenium server over. |
path , platform , javascript , native_events , extra_capabilities
|
Passed into |
In selenium_options()
, you can supply options to configure a server and
client run by RSelenium instead of selenium.
Instead of selenium_server_options()
, you can use wdman_server_options()
to allow wdman
to run the Selenium server using wdman::selenium()
.
Instead of using selenium_client_options()
, you can use
rselenium_client_options()
to control the creation of an
RSelenium::remoteDriver()
object instead.
Note that the driver
option of selenider_session()
also accepts these
objects in place of their selenium equivalents.
An options object that can be passed into selenium_options()
.