Notes on interesting papers I've read and everything else I find noteworthy, mostly for future reference.

- [Computational Linguistics 1996] A maximum entropy approach to natural language processing ml paper nlp
- Monte Carlo Tree Search rl
- [arXiv 2022] HyperTree Proof Search for Neural Theorem Proving rl ml paper theorem-proving
- [arXiv 2021] Proving Theorems using Incremental Learning and Hindsight Experience Replay ml paper theorem-proving
- [arXiv 2022] Autoformalization with Large Language Models language-models paper theorem-proving
- [ICML 2022] Prioritized Training on Points that are Learnable, Worth Learning, and Not Yet Learnt ml paper
- [ICLR 2022] An Explanation of In-context Learning as Implicit Bayesian Inference paper ml bayesian-inference in-context-learning
- [arXiv 2022] Data Distributional Properties Drive Emergent In-Context Learning in Transformers paper ml in-context-learning
- [ICML 2009] Curriculum learning ml paper
- GTD
- [EDM 2020] Variational Item Response Theory: Fast, Accurate, and Expressive ml paper psychometrics irt
- [ICML 2019] Challenging Common Assumptions in the Unsupervised Learning of Disentangled Representations ml paper
- [Journal of Mathematical Psychology 2007] ‘Ideal learning’ of natural language: Positive results about learning from positive evidence linguistics paper
- [PNAS 2022] One model for the learning of language linguistics paper cogsci
- [ICML 2021] LIME: Learning Inductive Bias for Primitives of Mathematical Reasoning ml paper theorem-proving
- [NeurIPS 2020] Big Bird: Transformers for Longer Sequences paper
- [arXiv] Longformer: The Long-Document Transformer paper
- [ICLR 2022] Memorizing Transformers paper
- [ICLR 2022] Proof Artifact Co-training for Theorem Proving with Language Models ml paper theorem-proving
- GNU/Linux Reference linux
- [NeurIPS 2018] How Does Batch Normalization Help Optimization? ml paper
- [NeurIPS 2019] Adversarial Examples Are Not Bugs, They Are Features ml robustness paper
- [ICLR 2022] Natural Language Descriptions of Deep Visual Features ml paper interpretability
- [ICCV 2021] Curious Representation Learning for Embodied Intelligence rl ml paper representation-learning
- [SocArxiv 2021] The InterModel Vigorish (IMV): A flexible and portable approach for quantifying predictive accuracy with binary outcomes paper psychometrics stats
- [ICLR 2022] Increasing the Cost of Model Extraction with Calibrated Proof of Work ml security paper
- Energy-Based Models ml generative-models
- [ICML 2017] Wasserstein Generative Adversarial Networks ml generative-models paper
- [NeurIPS 2016] f-GAN: Training Generative Neural Samplers using Variational Divergence Minimization ml generative-models paper
- [NeurIPS 2014] Generative Adversarial Nets paper
- [Cognitive Science 2020] Assessing Mathematics Misunderstandings via Bayesian Inverse Planning paper cogsci education bayesian-inference
- [ICLR 2015] Importance Weighted Autoencoders ml paper
- [ICLR 2017] beta-VAE: Learning Basic Visual Concepts with a Constrained Variational Framework ml paper
- Evidence Lower Bound (ELBO) stats ml
- [ICLR 2014] Auto-Encoding Variational Bayes ml generative-models paper
- Experiment Pre-registration research
- Fixing the resolution of an external display arch-linux
- [ICLR 2020] Plug and Play Language Models: A Simple Approach to Controlled Text Generation paper
- [ICLR 2021] IsarStep: A Benchmark For High-Level Mathematical Reasoning paper
- [ACL 2016] Deep Reinforcement Learning with a Natural Language Action Space paper
- [Educational Psychology 2011] Practicing Versus Inventing With Contrasting Cases: The Effects of Telling First on Learning and Transfer paper
- [Science 2018] A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play paper
- [arXiv] Generative Language Modeling for Automated Theorem Proving paper
- [ICLR 2014] Auto-Encoding Variatonal Bayes paper
- [ICLR 2017] Categorical Reparameterization with Gumbel-Softmax paper
- [EMNLP 2020] Unsupervised Question Decomposition for Question Answering paper
- [NeurIPS 2019] Representation Learning with Contrastive Predictive Coding paper
- [Psychological Review 2007] Word Learning as Bayesian Inference paper
- [ICFP 2011] Parsing with Derivatives: A Functional Pearl paper
- [Cognition 2012] The communicative function of ambiguity in language paper
- [KDD 2016] "Why Should I Trust You?": Explaining the Predictions of Any Classifier paper
- [CHI 2020] An Interaction Design for Machine Teaching to Develop AI Tutors paper
- [TCS 2020] Anchoring Utterances paper
- [arXiv 2020] Program Synthesis with Pragmatic Communication paper
- [NeurIPS 2019] Write, Execute, Assess: Program Synthesis with a REPL paper
- [ICLR 2019] Execution-Guided Neural Program Synthesis paper
- [NeurIPS 2018] Learning to Infer Graphics Programs from Hand-Drawn Images paper
- [CVPR 2016] Neural Module Networks paper
- [BRMIC 2004] AutoTutor: A tutor with dialogue in natural language paper
- [CVPR 2020] ALFRED: A Benchmark for Interpreting Grounded Instructions for Everyday Tasks paper
- [arXiv 2020] Language Models are Few-Shot Learners paper
- [POPL 2004] Abstractions from proofs paper
- [ICCV 2017] Inferring and Executing Programs for Visual Reasoning paper
- [NeurIPS 2015] Exploring Models and Data for Image Question Answering paper
- [NeurIPS 2017] Attention is All You Need paper
- [ACL 2020] Interactive Classification by Asking Informative Questions paper
- [CogSci 2019] Asking goal-oriented questions and learning from answers paper
- [NeurIPS 2019] Zero-shot Learning via Simultaneous Generating and Learning paper