Natural Language Processing Challenge

Introduction

Learning how to process text is a skill required for Data Scientists. In this project, you will put these skills into practice to identify whether a sentence was automatically translated or translated by a human.

Project Overview

In this repository you will find dataset containing sentences in Spanish and their tags: 0, if the sentences was translated by a Machine, 1, if the sentence was translated by a professional translator. Your goal is to build a classifier that is able to distinguish between the two.

Guidance

Like in a real life scenario, you are able to make your own choices and text treatment. Use the techniques you have learned and the common packages to process this data and classify the text.

Deliverables

Python Code: Provide well-documented Python code that conducts the analysis.
Accuracy estimation: Provide the teacher with your estimation of how your model will perform.
Classified Dataset: On Friday, you will receive a dataset without tags. Prepare your code to be able to tag that dataset.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md
REAL_DATA.txt		REAL_DATA.txt
TRAINING_DATA.txt		TRAINING_DATA.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Natural Language Processing Challenge

Introduction

Project Overview

Guidance

Deliverables

About

Releases

Packages

ironhack-labs/project-3-nlp

Folders and files

Latest commit

History

Repository files navigation

Natural Language Processing Challenge

Introduction

Project Overview

Guidance

Deliverables

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages