Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Doug/create data loader #2

Merged
merged 13 commits into from
Oct 17, 2022
Merged

Doug/create data loader #2

merged 13 commits into from
Oct 17, 2022

Conversation

dgjlindsay
Copy link
Contributor

No description provided.

@github-actions
Copy link

6f21d76

Model Type: DummyClassifier

Train Data Length: 445982
Test Data Length: 148661

f1_score - 0.000000
roc_auc - 0.500000
accuracy - 0.987919

CML watermark

Copy link
Collaborator

@shelmigtwo shelmigtwo left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @dgjlindsay! Just a few minor things mostly related to getting the CI to pass.
Also could you run black over these files as well?

DEFAULT_OBJECT_NAME = 'bs140513_032310.csv'

def load_data(object_name: str = None):
object_url = BUCKET_URL + (object_name or DEFAULT_OBJECT_NAME)
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Could you add a docstring for this function, so that interrogate is happy? 🙂

@@ -0,0 +1,33 @@
from unittest.mock import patch, Mock

from util.data_access import load_data, DEFAULT_OBJECT_NAME, BUCKET_URL
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think this may require a full path, so src.util.data_access, for the tests to pass in the GitHub action.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Fixed the CI PYTHONPATH instead, so that we can import like this without the "src" prefix

@@ -0,0 +1,8 @@
import pandas
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Very minor, but the convention for pandas usage is to import it as pd

@github-actions
Copy link

184b9e6

Model Type: DummyClassifier

Train Data Length: 445982
Test Data Length: 148661

f1_score - 0.000000
roc_auc - 0.500000
accuracy - 0.987919

CML watermark

@github-actions
Copy link

5c00658

Model Type: DummyClassifier

Train Data Length: 445982
Test Data Length: 148661

f1_score - 0.000000
roc_auc - 0.500000
accuracy - 0.987919

CML watermark

@github-actions
Copy link

7f0fcf7

Model Type: DummyClassifier

Train Data Length: 445982
Test Data Length: 148661

f1_score - 0.000000
roc_auc - 0.500000
accuracy - 0.987919

CML watermark

@github-actions
Copy link

801ea61

Model Type: DummyClassifier

Train Data Length: 445982
Test Data Length: 148661

f1_score - 0.000000
roc_auc - 0.500000
accuracy - 0.987919

CML watermark

@github-actions
Copy link

3b03a61

Model Type: DummyClassifier

Train Data Length: 445982
Test Data Length: 148661

f1_score - 0.000000
roc_auc - 0.500000
accuracy - 0.987919

CML watermark

@github-actions
Copy link

d8f488e

Model Type: DummyClassifier

Train Data Length: 445982
Test Data Length: 148661

f1_score - 0.000000
roc_auc - 0.500000
accuracy - 0.987919

CML watermark

@github-actions
Copy link

92dc9a6

Model Type: DummyClassifier

Train Data Length: 445982
Test Data Length: 148661

f1_score - 0.000000
roc_auc - 0.500000
accuracy - 0.987919

CML watermark

@github-actions
Copy link

dd562d6

Model Type: DummyClassifier

Train Data Length: 445982
Test Data Length: 148661

f1_score - 0.000000
roc_auc - 0.500000
accuracy - 0.987919

CML watermark

@github-actions
Copy link

8a00989

Model Type: DummyClassifier

Train Data Length: 445982
Test Data Length: 148661

f1_score - 0.000000
roc_auc - 0.500000
accuracy - 0.987919

CML watermark

@dgjlindsay dgjlindsay merged commit 5f05004 into staging Oct 17, 2022
@dgjlindsay dgjlindsay deleted the doug/create_data_loader branch October 17, 2022 13:57
@github-actions
Copy link

5f05004

Model Type: DummyClassifier

Train Data Length: 445982
Test Data Length: 148661

f1_score - 0.000000
roc_auc - 0.500000
accuracy - 0.987919

CML watermark

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants