This module provides comprehensive browser automation capabilities using Selenium WebDriver with undetected Chrome support.

Overview

The automation module enables:

Undetected Browser Control: Uses undetected_chromedriver to avoid detection
Text-based Element Finding: Locate elements by visible text content
Visual Element Matching: Use Airtest for image-based element detection
Page State Monitoring: Wait for HTML and visual stabilization
Document Conversion: Convert Markdown with Ruby annotations to PNG/PDF

Core Components

Browser initialization and management
Element finding strategies (text-based and visual)
Page waiting and monitoring utilities
Screenshot and document generation tools

Installation Requirements

Install the required dependencies for browser automation:

pip install undetected_chromedriver webdriver_manager

# from selenium.webdriver.chrome.service import Service
# from webdriver_manager.chrome import ChromeDriverManager

# def init(*arguments):
#     chrome_options = webdriver.ChromeOptions()
#     for argument in arguments:
#         if isinstance(argument, str):
#             chrome_options.add_argument(argument)
#     global driver
#     driver = webdriver.Chrome(
#         options = chrome_options,
#         service = Service(ChromeDriverManager().install())
#     )
#     global device_pixel_ratio
#     device_pixel_ratio = driver.execute_script('return window.devicePixelRatio;')

init('--user-data-dir=C:\\Users\\seii-saintway\\Downloads\\chrome-profile')

Logging Configuration

Set up logging for the automation module to track operations and debug issues.

Text-based Element Detection

This section implements intelligent element finding using visible text content. The system searches through different DOM scopes and supports both exact and partial text matching.

Text-based Browser Automation

Implement browser automation using text-based element detection with enhanced scope searching.

Page State Monitoring

These utilities help ensure page stability before taking actions or screenshots by monitoring HTML content and visual changes.

Static Page Inspection

Use Selenium for static inspection of page appearance and content monitoring.

Visual Element Detection with Airtest

This section integrates Airtest for image-based element detection, useful when text-based approaches are insufficient.

Airtest Integration

Use Airtest for advanced browser automation with image-based element detection.

Document Conversion Tools

These tools convert Markdown files with Ruby annotations (for Japanese text) into PNG images and PDF documents using browser rendering.

pip install airtest

Document Conversion

Use Selenium to convert Markdown files with Ruby annotations to PNG and PDF formats.

pip install markdown

init('--lang=en')

convert_md_with_ruby_to_png('2024-11-21.md')

convert_md_with_ruby_to_pdf('2024-11-21.md')

Browser Automation Module

Overview

Core Components

Installation Requirements

init[source]

quit[source]

Logging Configuration

Text-based Element Detection

Text-based Browser Automation

ok[source]

last[source]

new[source]

close[source]

find_elements[source]

find_elements[source]

find_element[source]

Page State Monitoring

click[source]

input[source]

Static Page Inspection

get_html_hash[source]

wait[source]

Visual Element Detection with Airtest

screen_hash[source]

watch[source]

Airtest Integration

Document Conversion Tools

try_log_screen[source]

find_position[source]

inject[source]

get_mouse_position[source]

move_to_center[source]

move_and_click[source]

exists[source]

touch[source]

fill[source]

Document Conversion

convert_md_with_ruby_to_html[source]

convert_md_content_to_html[source]

save_file[source]

convert_html_with_ruby_to_png[source]

convert_md_with_ruby_to_png[source]

dialog_for_printing[source]

convert_html_with_ruby_to_pdf[source]

convert_md_with_ruby_to_pdf[source]

`init`[source]

`quit`[source]

`ok`[source]

`last`[source]

`new`[source]

`close`[source]

`find_elements`[source]

`find_elements`[source]

`find_element`[source]

`click`[source]

`input`[source]

`get_html_hash`[source]

`wait`[source]

`screen_hash`[source]

`watch`[source]

`try_log_screen`[source]

`find_position`[source]

`inject`[source]

`get_mouse_position`[source]

`move_to_center`[source]

`move_and_click`[source]

`exists`[source]

`touch`[source]

`fill`[source]

`convert_md_with_ruby_to_html`[source]

`convert_md_content_to_html`[source]

`save_file`[source]

`convert_html_with_ruby_to_png`[source]

`convert_md_with_ruby_to_png`[source]

`dialog_for_printing`[source]

`convert_html_with_ruby_to_pdf`[source]

`convert_md_with_ruby_to_pdf`[source]