Skip to content
View AvaAvarai's full-sized avatar
☃️
☃️

Highlights

  • Pro

Organizations

@CWU-VKD-LAB

Block or report AvaAvarai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
AvaAvarai/README.md

"I believe in intuition and inspiration. Imagination is more important than knowledge. For knowledge is limited, whereas imagination embraces the entire world, stimulating progress, giving birth to evolution. It is, strictly speaking, a real factor in scientific research.”
Albert Einstein 1929, quoted by Dr. Boris Kovalerchuk in Visual Knowledge Discovery and Machine Learning 2018.

Alice Williams

GitRoll Profile Badge

Double Bachelor's of Science senior student majoring in Computer Science and Applied Mathematics. Active research assistant at the Visual Knowledge Discovery and Imaging Lab of Central Washington University. Former system administrator, full-stack software developer, and web-developer. Next, graduate studies in Computational Science focusing on visual machine learning. With a goal of building trusted expert artificial intelligence systems.

Solving problems at the intersection of machine learning and data visualization by blending machine computation with human cognition. Computing with data visualizations directly, instead of just making visuals. While integrating visual approaches with conventional machine learning methods.

Oxford Dictionary, "Intelligence", noun, 1. the ability to acquire and apply knowledge and skills. Research goals: Aquire knowledge by visual knowledge discovery with multidimensional lossless visualizations, then apply aquired knowledge to build skills that solve difficult tasks more interpretably and accurately.

If you find my work of interest or benefit, then please consider a supporting gesture through "Buy Me A Coffee", thank you.

Research Interests

In no preferential order.

Research Interest Description
Visual Machine Learning Building machine learning models with visual representations for better interpretability.
Visual Knowledge Discovery Extracting patterns visually from multidimensional data to solve a task like machine learning.
Data Mining Extracting valuable information from large sample count and dimensionality of data.
Multidimensional Data Visualization Representing multidimensional data in representations advantageous for the task to solve.
Natural Language Processing Analyzing and generating human language for human-computer interaction or computation.
Automated Decision-Making Developing and analyzing automated decision-making systems for limited human interaction.
Human-Computer Interaction Designing effective human-computer interaction using visually interactive interfaces.

Academic Research

Chronologically ordered from current at top to first at bottom.

  1. "Synthetic Data Generation with Visual Knowledge Discovery"
  • Authors: Alice Williams and Dr. Boris Kovalerchuk.
  • Status: Actively researching at the CWU Visual Knowledge Discvery and Imaging Lab.
  • Contributions: Expanding on synthetic data generation methods with interactive algorithms paired with Generative Adversarial Networks (GANS). Expanding software with relabeling and GAN data synthesization, in support of theorized algorithms. Exploring GAN deficiencies and seeking ways to integrate visual improvements.
  • Topics: Synthetic Tabular Data Generation, Image to tabular data preprocessing, General Line Coordinates, Hyperblock Model Representation, Generative Adversarial Networks, Multi-Row Parallel Coordinates, Imagery Visualization
  • Developed software: Dynamic_Coordinates_Vis_System, ML_Classifier_Comparison_Tool, DCGAN_Custom_Architecture_Builder_and_Image_Synthesizer
  1. "Interpretable Computational Visual Knowledge Discovery and Boosting Classification Models with Human-in-the-Loop"
  • Authors: Alice Williams & Dr. Boris Kovalerchuk.
  • Status: Preparing camera ready version, accepted for presentation at AI Human Computer Interfaces 2025 in thematic area of Artificial Intelligence in HCI.
  • Contributions: Visual solution to the 2003 Gödel Prize problem of AdaBoost classifier by visual observation of options for different classifiers before selecting them for boosting allowing alternative ways to build stronger classifiers, including directly building classifiers in a lossless visualization space and visualizing classifier uncertainty, applying visual knowledge discovery methods to extract data representational rules and improve classification models by rebuilding them with interpretable hyperblock representations.
  • Topics: Human-Centered AI, Machine Learning, Multi-Class Classification, Feature Engineering, Explainable AI, AI Risk Mitigation, Human-in-the-Loop, Multi-Dimensional Distributions, General Line Coordinates, Concentric Coordinates.
  • Developed software: Java_Tabular_Vis_Toolkit, HyperblockParser, InLineCoordinatesCoefficientSolver
  1. "Synthetic Data Generation and Automated Multidimensional Data Labeling for AI/ML in General and Circular Coordinates"
  • Authors: Alice Williams & Dr. Boris Kovalerchuk.
  • Status: Published in IEEE proceedings and presented at IV2024 in track of Artificial Intelligence and Visual Knowledge Discovery, earned the 'Best Paper Award'.
  • Contributions: Proposed an algorithm and implemented interactive software for labeled synthetic data generation using former General Line Coordinate methods and newly developed Circular Coordinates both Static and Dynamic with multi-class visualization and parameterized class discrimination. Addressed data balancing, demonstrated deficiencies of popular SMOTE (Synthetic Minority Oversampling Technique), and showed improvements to classifier performance across 14 standard machine learning classifiers.
  • Topics: Synthetic Data Generation, Automated Data Labeling, General Line Coordinates, Circular Coordinates, Parallel Coordinates, Shifted Paired Coordinates, Tabular AI/ML Data, Multidimensional Data Visualization, Visual Knowledge Discovery.
  • Developed software: Dynamic_Coordinates_Vis_System

Technical Experiences

Role Organization Focus/Description
Startup Founder and Consultant (Active) AI Education Technology Stealth Startup LLCs Small business founding, startup product development, market research, and additionally, I provide AI and ML consulting.
Research Assistant (Active) CWU Visual Knowledge Discovery and Imaging Lab Researching Visual Knowledge Discovery and Machine Learning data classification, synthesization, and model interpretability.
Teaching Assistant CWU Computer Science and Mathematics Departments Assisted in Computer Architecture, Algorithm Analysis, Mathematical Computing, and as an undergraduate CS Tutor.
Web Developer Freelance Sole-Proprietor Business Supported individuals and small businesses in developing, updating, and maintaining web applications and websites.
Full-Stack Software Developer Contract Projects and Freelance Business Delivered data processing automation and business solutions for individuals, small businesses, and an enterprise contract.
Linux Game Server Administrator Game Server Distributor and Management LLCs Responsible for server sharding, setup, protection, updates, backup solutions, user registration, and software support.

Let's Connect

I'm always open to discussing new projects or opportunities. Feel free to reach out or connect with me!

Pinned Loading

  1. Dynamic_Coordinates_Vis_System Dynamic_Coordinates_Vis_System Public

    Build visual machine learning models with multidimensional general line coordinate visualizations by interactive classification and synthetic data generation tools.

    Python 5 2

  2. Java_Tabular_Vis_Toolkit Java_Tabular_Vis_Toolkit Public

    A cross-platform tool for building machine learning models with General Line Coordinates lossless data visualizations, analyzing classifier errors, and improving classification with assistive compu…

    Java 2 1

  3. ML_Classifier_Comparison_Tool ML_Classifier_Comparison_Tool Public

    Machine Learning classifier comparison GUI application. Choose 21 classifiers, evaluation data (optional for evaluation of synthetic data), hyperparameters, cross-validation splits, and rng seed; t…

    Python 1 1

  4. Hyperblock_Parser Hyperblock_Parser Public

    Parses conjunctive normal form hyperblock notation to parallel coordinate graph visualizations. Hyperblocks are an interpretable way to build machine learning models.

    Python 1 1

  5. DCGAN_Custom_Architecture_Builder_and_Image_Synthesizer DCGAN_Custom_Architecture_Builder_and_Image_Synthesizer Public

    DCGAN (Deep Convolutional Generative Adversarial Network) custom architecture builder and image synthesizer to specify the architecture of the generator and discriminator, visualize the models, tra…

    Python 2

  6. Local_Small_LM_Document_RAG Local_Small_LM_Document_RAG Public

    Local semantic sentence embedding Reader-Answerer model for Retrieval Augmented Generation (RAG) of cited question answering from .pdf, .md, & .docx files using small language models.

    Python 4