Project Name

Welcome to the Japanese Lawyer using LLM. You can asy any japanese legal question and it'll answer correctly.

Introduction

This LLM Apllication was build to answer any legal Japanese question. After openning the application you can ask any legal japanese question the model will provide an answer.

You can change the model's knowledgebase and turn it into something different (teacher, doctor etc) by providing relevant text into a file and running "create_vector_database.py". You to change some paths in 'config.py' for it to work but after that you can ask any questions from your knowledgebase and the model will be able to answer it.

Installation

To get started, you need to set up the Conda environment.

Step 1: Install Conda

If you haven't already, install Conda from the official Anaconda website and follow the installation instructions.

Step 2: Create the Conda environment

Once Conda is installed, create a new environment named llm_module using the provided .yml file and activate that environment:

conda env create -f environment.yml
conda activate llm_module

Create Knowlegebase

To create the knowlegde base (In this case the knowledgebase is the entire legal text of Japan) you need to put all of the txt file in "Legal Text" Directory. You can change the txt directory and vectordatabase directory in the "config.py"

dir_path = '/media/nsl3090-3/hdd1/hujaifa/JP_NSL_Lawyer/Legal Text'
dataset_dir = '/media/nsl3090-3/hdd1/hujaifa/JP_NSL_Lawyer/Vector Database/db_jp_law'

then run "create_vector_database.py"

python create_vector_database.py

After running this script the vector database will be saved in "dataset_dir"

Hyperparameters

Before running the app, you can change the vector chunk size, vector overlap and vector seperator. You can also change the base model and embed model. To do these changes you have to edit the 'config.py' file

vector_chunk_size=512
vector_overlap = 20
vector_separator = '\n\n\n'

embed_model_path = 'intfloat/multilingual-e5-large'
llm_model_path = 'elyza/ELYZA-japanese-Llama-2-7b-fast-instruct'
generation_max_len = 512
temperature = 0

Inference

To run the app you just hape to run the following command,

python app.py

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Demo		Demo
Legal Text		Legal Text
static		static
templates		templates
app.py		app.py
config.py		config.py
create_vecotor_database.py		create_vecotor_database.py
environment.yml		environment.yml
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project Name

Table of Contents

Introduction

Installation

Step 1: Install Conda

Step 2: Create the Conda environment

Create Knowlegebase

Hyperparameters

Inference

Demo

About

Releases

Packages

Languages

Hujaifa-Git/Japanese-Lawyer-using-Langchain-and-LLM

Folders and files

Latest commit

History

Repository files navigation

Project Name

Table of Contents

Introduction

Installation

Step 1: Install Conda

Step 2: Create the Conda environment

Create Knowlegebase

Hyperparameters

Inference

Demo

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages