Skip to content

Files

Latest commit

0f020fe · Jun 7, 2023

History

History
45 lines (34 loc) · 1.45 KB

README.md

File metadata and controls

45 lines (34 loc) · 1.45 KB

KoGrammar

Korean Grammar Correction Model based on LLM

A Project for Introduction to Text Processing(LIS3813)

This project is ongoing.

Model

How To Use

  • Requirements

    torch
    transformers
    
  • Inference

    from transformers import BartConfig
    from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
    from transformers import pipeline
    
    checkpoint = 'theSOL1/kogrammar-base'
    tokenizer = AutoTokenizer.from_pretrained(checkpoint)
    config = BartConfig.from_pretrained(checkpoint)
    model = AutoModelForSeq2SeqLM.from_pretrained(checkpoint, config=config, device_map='auto')
    pipe = pipeline('text2text-generation', model=model, tokenizer=tokenizer)
    
    sample_text = 'ㄴㅏ는 ㄱㅏ끔 눈물을흘린다'
    corrected_text = pipe(sample_text)
    print(corrected_text)

Docs