Scan2Text

OCRを使用して画像から文書のレイアウトを認識し、編集可能な形式に変換するWebアプリケーション。

機能

画像アップロード
OCRによるテキスト認識
レイアウト解析と表示
テキストとレイアウトの編集
結果の保存と管理

必要条件

Python 3.8以上
Tesseract OCR
Flask
その他の依存パッケージ（requirements.txtを参照）

インストール

リポジトリのクローン:

git clone https://github.com/mrexcellency/scan2text.git
cd scan2text

仮想環境の作成と有効化:

python -m venv venv
source venv/bin/activate  # Linux/Mac
venv\Scripts\activate  # Windows

依存パッケージのインストール:

pip install -r requirements.txt

Tesseract OCRのインストール:

Windows: https://github.com/UB-Mannheim/tesseract/wiki
Linux: sudo apt-get install tesseract-ocr
Mac: brew install tesseract

使用方法

データベースの初期化:

python init_db.py

アプリケーションの起動:

python app.py

ブラウザで以下のURLにアクセス:

http://localhost:8000

開発環境のセットアップ

テスト実行:

python -m unittest -v test_app.py

デバッグモードでの実行:

export FLASK_ENV=development  # Linux/Mac
set FLASK_ENV=development    # Windows
python app.py

ライセンス

MIT License - 詳細はLICENSEファイルを参照してください。

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
samples		samples
screen_shots		screen_shots
static		static
templates		templates
.bash_profile		.bash_profile
.bashrc		.bashrc
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
init_db.py		init_db.py
requirements.txt		requirements.txt
test_app.py		test_app.py
test_ocr.py		test_ocr.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Scan2Text

機能

必要条件

インストール

使用方法

開発環境のセットアップ

ライセンス

About

Releases

Packages

Languages

License

mrexcellency/scan2print

Folders and files

Latest commit

History

Repository files navigation

Scan2Text

機能

必要条件

インストール

使用方法

開発環境のセットアップ

ライセンス

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages