Skip to content

Latest commit

 

History

History
83 lines (52 loc) · 4.35 KB

README.md

File metadata and controls

83 lines (52 loc) · 4.35 KB

Typing SVG

Docs License Python NumPy

⏳ AutonFeat ⌛

A high performance library for time series featurization.

Library

Resources 📚

What? 🙋

AutonFeat is a high-performant domain agnostic package for time series featurization. Despite the domain agnostic focus of the package, we recognize the benefit of domain knowledge and have included a few domain specific featurizers for popular domains like healthcare. With time series data, as with any data, it is often helpful to perform preprocessing before extracting information from it such as exploring the frequency domain as well as the time domain. We have provided a number of preprocessors that can transform the distribution or space to a form more amenable to certain featurizations. The package is lightweight, fast and easy to use. We hope you enjoy it! 🎉

Here's an illustration of what featurization looks like:

AutonFeat

Why? 🤔

To prevent others from reinventing the wheel, we have compiled a featurization library for dealing with time-series data. We have also included a number of preprocessors to transform the data into a form more amenable to certain featurizations. Finally, our goal was to make this package without too many dependencies and overhead.

AutonFeat provides a number of advantages over other packages:

  • Simple: The package must be easy to use and require as little user input as possible.
  • Interpretable: The software abstractions must be intuitive, easy to understand and easy to debug.
  • Fast: The tool must be fast enough to be used in large scale production environments.
  • Flexible: The package must be modular and allow for easy extensibility to leverage community contributions.

Assumptions 🧐

Note: We have made a few assumptions to start out with but we are working on making the package more flexible and robust. If you have any suggestions, please open an issue or PR! 🙂

  • The input data is a 1D time series in the form of a numpy array.
  • If there are missing values, they must be represented by np.nan to be detected, otherwise, gaps in the time series are not detected.

Installation 📦

pip install autonfeat

Installing inside a python virtual environment or a conda environment is recommended.

Features 🧠

We provide a variety of features ranging from domain agnostic to domain specific (e.g. healthcare) featurizers, as well as a number of preprocessors to transform the data into a form more amenable to certain featurizations. This list is constantly growing so please check back often! Feel free to contribute your own featurizers and open a PR! 🎉

Contributing 🤝

We'd love to hear from you! If you've found anything missing, feel free to open an issue or PR! 🙂

Authors 👨‍💻

Dhruv Srikanth

Auton Lab

License 📝

License

For more details, check out the license here.

If you enjoy using AutonFeat, please consider starring the repository ⭐️.