This module provides Japanese natural language processing capabilities using PyKNP (Python interface for JUMAN++ and KNP).

Overview

The NLP module enables:

Morphological Analysis: Break down Japanese text into morphemes using JUMAN++
Ruby Annotation: Automatically add furigana (reading aids) to kanji characters
Text Formatting: Convert between halfwidth and fullwidth characters
HTML Integration: Process HTML content and add ruby annotations to Japanese text

Key Features

Install PyKNP and JUMAN++ for Japanese morphological analysis:

Install JUMAN++

pip install pyknp

These functions handle Japanese text analysis and ruby annotation generation using JUMAN++ morphological analyzer.

Additional dependencies for HTML content processing:

This function processes HTML content and adds ruby annotations to Japanese text within HTML elements.

pip install beautifulsoup4
pip install lxml