Skip to content
View jasonlim131's full-sized avatar
  • philadelphia

Block or report jasonlim131

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official PyTorch Implementation of EMoE: Unlocking Emergent Modularity in Large Language Models [main conference @ NAACL2024]

Python 29 3 Updated May 28, 2024

Stanford NLP Python library for Representation Finetuning (ReFT)

Python 1,438 124 Updated Feb 6, 2025

Stanford NLP Python library for benchmarking the utility of LLM interpretability methods

Python 58 4 Updated Feb 23, 2025

Fast Forward the finetuning of LLMs

Python 2 1 Updated Sep 25, 2024

Providing the answer to "How to do patching on all available SAEs on GPT-2?". It is an official repository of the implementation of the paper "Evaluating Open-Source Sparse Autoencoders on Disentan…

Python 11 1 Updated Jan 26, 2025

Stanford NLP Python library for understanding and improving PyTorch models via interventions

Python 710 80 Updated Feb 24, 2025

How do transformers model physics? Just like we do!

Python 7 1 Updated May 28, 2024
JavaScript 3,005 400 Updated Mar 8, 2025
Jupyter Notebook 1 1 Updated Oct 22, 2024

A reading list for papers on causality for natural language processing (NLP)

618 65 Updated Mar 3, 2025

Code for the paper: Discover-then-Name: Task-Agnostic Concept Bottlenecks via Automated Concept Discovery. ECCV 2024.

Python 36 2 Updated Nov 3, 2024

General-purpose activation steering library

Python 49 6 Updated Jan 3, 2025

Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipe…

Python 266 73 Updated Oct 9, 2022

Clustered SAE Steering Code and Experiments

Python 1 Updated Oct 15, 2024

A curated list of awesome Category Theory resources.

112 5 Updated Feb 24, 2024
Python 1 Updated Oct 18, 2024

real time face swap and one-click video deepfake with only a single image

Python 44,542 6,564 Updated Mar 6, 2025

A collection of (mostly) technical things every software developer should know about

86,530 7,958 Updated Aug 6, 2024

A resource repository for representation engineering in large language models

109 5 Updated Nov 14, 2024

The AI to keep you focused 😈

Python 354 40 Updated Feb 20, 2025
Python 64 13 Updated Jul 16, 2024

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,920 1,057 Updated Mar 6, 2025

Official repository for the paper "Grokfast: Accelerated Grokking by Amplifying Slow Gradients"

Python 547 45 Updated Jun 28, 2024

Steering vectors for transformer language models in Pytorch / Huggingface

Python 90 8 Updated Feb 21, 2025

ViT Prisma is a mechanistic interpretability library for Vision Transformers (ViTs).

Jupyter Notebook 213 23 Updated Mar 9, 2025

[ACL 2024] Language Models Don't Learn the Physical Manifestation of Language

Python 1 Updated Jun 13, 2024
Python 504 34 Updated Jul 29, 2024

Kolmogorov Arnold Networks

Jupyter Notebook 15,486 1,458 Updated Jan 19, 2025

Tools for understanding how transformer predictions are built layer-by-layer

Python 478 50 Updated Jun 2, 2024

Tools for studying developmental interpretability in neural networks.

Python 86 16 Updated Jan 24, 2025
Next
Showing results