Skip to content

Commit

Permalink
addition
Browse files Browse the repository at this point in the history
  • Loading branch information
nichoffs committed Aug 15, 2024
1 parent c542fb8 commit 8424206
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion content/post/mini_projects/grokking.md
Original file line number Diff line number Diff line change
Expand Up @@ -22,7 +22,7 @@ color = "" #color from the theme settings

- **GPT** - Generatively Pre-trained Transformer - you should know how these work before reading.

# What is grokking (mechanstically)?
# What is grokking (mechanistically)?

Researchers in the field of mechanistic interpretability have adopted the term "grokking" to refer to a unique training behavior observed in neural networks. Grokking describes the phenomenon of "delayed generalization," where a model initially overfits the training data, reaching zero training loss, before eventually developing a more general solution to the task that significantly reduces test loss.

Expand Down

0 comments on commit 8424206

Please sign in to comment.