Hamish Ivison

Hi, I’m Hamish! I’m (currently) a PhD student at the University of Washington at H2Lab, advised by Hannaneh Hajishirzi. I’m generally interested in NLP research, with a focus on post-training for language models. I’m interested in making language models more usable for more people, and exploring ways to improve them that go beyond next-token training. Additionally, I’m also interested in improving and exploring language model data mixtures, and have dabbled in exploring alternatives approaches to language modelling.

I’m from Sydney and did my undergraduate at the University of Sydney, doing a Bachelor of Arts and IT and triple majoring in Linguistics, Classical Greek, and Computer Science. I also did some NLP with the UsydNLP group, examining multi-hop question answering. Throughout my undergrad (and just after), I spent some time at the Commonwealth Bank of Australia, start-up-y stuff, and Optiver. Before my PhD, I was a predoctoral researcher at AI2 on the AllenNLP team.

If you have questions about my work, general academia/software/research-related stuff, or want to chat, feel free to reach out at hamishiv [at] cs [dot] washington [dot] edu. I am generally happy to answer most questions! You can also find me on various social media at @hamishivi.

Papers

See below for papers I’ve worked on. You can also check out my Semantic Scholar and Google Scholar profiles.

In addition to these, I also help maintain Open-Instruct, a codebase for general LM post-training. Send me a note if you need some help with it!

Generalizing Verifiable Instruction FollowingValentina Pyatkin, Saumya Malik, Victoria Graf, Hamish Ivison, Shengyi Huang, Pradeep Dasigi, Nathan Lambert, and Hannaneh Hajishirzi. 2025.

@article{pyatkin2025generalizing,
  title = {{Generalizing Verifiable Instruction Following}},
  author = {Pyatkin, Valentina and Malik, Saumya and Graf, Victoria and Hamish Ivison and Huang, Shengyi and Dasigi, Pradeep and Lambert, Nathan and Hajishirzi, Hannaneh},
  year = {2025},
  url = {https://arxiv.org/abs/2507.02833},
  code = {https://github.com/allenai/IFBench}
}

The Delta Learning Hypothesis: Preference Tuning on Weak Data can Yield Strong GainsScott Geng, Hamish Ivison, Chun-Liang Li, Maarten Sap, Jerry Li, Ranjay Krishna, and Pang Wei Koh. 2025. COLM.

@article{geng2025delta,
  title = {{The Delta Learning Hypothesis: Preference Tuning on Weak Data can Yield Strong Gains}},
  author = {Geng, Scott and Hamish Ivison and Li, Chun-Liang and Sap, Maarten and Li, Jerry and Krishna, Ranjay and Koh, Pang Wei},
  year = {2025},
  journal = {COLM},
  url = {https://arxiv.org/abs/2507.06187}
}

Large-Scale Data Selection for Instruction TuningHamish Ivison, Muru Zhang, Faeze Brahman, Pang Wei Koh, and Pradeep Dasigi. 2025.

@article{ivisondata2025,
  title = {{Large-Scale Data Selection for Instruction Tuning}},
  author = {Hamish Ivison and Zhang, Muru and Brahman, Faeze and Koh, Pang Wei and Dasigi, Pradeep},
  year = {2025},
  eprint = {2503.01807},
  archiveprefix = {arXiv},
  primaryclass = {cs.CL},
  url = {https://arxiv.org/abs/2503.01807},
  code = {https://github.com/hamishivi/automated-instruction-selection}
}

TESS 2: A Large-Scale Generalist Diffusion Language ModelJaesung Tae*, Hamish Ivison*, Sachin Kumar, and Arman Cohan. 2025. ACL.

@article{taeivison2025tess2,
  title = {{TESS 2: A Large-Scale Generalist Diffusion Language Model}},
  author = {Tae*, Jaesung and Hamish Ivison* and Kumar, Sachin and Cohan, Arman},
  year = {2025},
  journal = {ACL},
  url = {https://arxiv.org/abs/2502.13917},
  code = {https://github.com/hamishivi/tess-2}
}

2 OLMo 2 FuriousTeam OLMo (inc. Hamish Ivison). 2024.

Tülu 3: Pushing Frontiers in Open Language Model Post-TrainingNathan Lambert*, Jacob Morrison*, Valentina Pyatkin*, Shengyi Huang*, Hamish Ivison*, Faeze Brahman*, Lester James V. Miranda*, Alisa Liu, Nouha Dziri, Shane Lyu, Yuling Gu, Saumya Malik, Victoria Graf, Jena D. Hwang, Jiangjiang Yang, Ronan Le Bras, Oyvind Tafjord, Chris Wilhelm, Luca Soldaini, et al. 2024. COLM.

@article{lambert2024tulu3,
  title = {Tülu 3: Pushing Frontiers in Open Language Model Post-Training},
  author = {Lambert*, Nathan and Morrison*, Jacob and Pyatkin*, Valentina and Huang*, Shengyi and Hamish Ivison* and Brahman*, Faeze and Miranda*, Lester James V. and Liu, Alisa and Dziri, Nouha and Lyu, Shane and Gu, Yuling and Malik, Saumya and Graf, Victoria and Hwang, Jena D. and Yang, Jiangjiang and Bras, Ronan Le and Tafjord, Oyvind and Wilhelm, Chris and Soldaini, Luca and Smith, Noah A. and Wang, Yizhong and Dasigi, Pradeep and Hajishirzi, Hannaneh},
  year = {2024},
  journal = {COLM},
  email = {tulu@allenai.org},
  url = {https://arxiv.org/abs/2411.15124},
  code = {https://github.com/allenai/open-instruct}
}

Personalizing Reinforcement Learning from Human Feedback with Variational Preference LearningSriyash Poddar*, Yanming Wan*, Hamish Ivison, Abhishek Gupta, and Natasha Jaques. 2024. NeurIPS.

@article{Poddar2024PersonalizingRL,
  title = {Personalizing Reinforcement Learning from Human Feedback with Variational Preference Learning},
  author = {Poddar*, Sriyash and Wan*, Yanming and Hamish Ivison and Gupta, Abhishek and Jaques, Natasha},
  year = {2024},
  url = {https://arxiv.org/abs/2408.10075},
  code = {https://github.com/WEIRDLabUW/vpl_llm},
  journal = {NeurIPS}
}

Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference FeedbackHamish Ivison, Yizhong Wang, Jiacheng Liu, Zeqiu Wu, Valentina Pyatkin, Nathan Lambert, Noah A. Smith, Yejin Choi, and Hannaneh Hajishirzi. 2024. NeurIPS.

@article{ivison2024unpacking,
  title = {Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback},
  author = {Hamish Ivison and Wang, Yizhong and Liu, Jiacheng and Wu, Zeqiu and Pyatkin, Valentina and Lambert, Nathan and Smith, Noah A. and Choi, Yejin and Hajishirzi, Hannaneh},
  year = {2024},
  eprint = {2406.09279},
  journal = {NeurIPS},
  url = {https://arxiv.org/abs/2406.09279},
  code = {https://github.com/allenai/open-instruct}
}

OLMo: Accelerating the Science of Language ModelsDirk Groeneveld, Iz Beltagy, ..., Hamish Ivison, ..., Noah A. Smith, and Hannaneh Hajishirzi. 2024. ACL.

Backtracking Mathematical Reasoning of Language Models to the Pretraining DataYasaman Razeghi*, Hamish Ivison*, Sameer Singh, and Yanai Elazar. 2024. The Second Tiny Papers Track at ICLR 2024.

@article{backtracking,
  title = {Backtracking Mathematical Reasoning of Language Models to the Pretraining Data},
  author = {Razeghi*, Yasaman and Hamish Ivison* and Singh, Sameer and Elazar, Yanai},
  booktitle = {The Second Tiny Papers Track at ICLR 2024},
  year = {2024},
  url = {https://openreview.net/pdf?id=otHhLO7GZj}
}

TESS: Text-to-Text Self-Conditioned Simplex DiffusionRabeeh Karimi Mahabadi*, Hamish Ivison*, Jaesung Tae, James Henderson, Iz Beltagy, Matthew E. Peters, and Arman Cohan. 2024. EACL.

@article{tess,
  author = {Mahabadi*, Rabeeh Karimi and Hamish Ivison* and Tae, Jaesung and Henderson, James and Beltagy, Iz and Peters, Matthew E. and Cohan, Arman},
  title = {TESS: Text-to-Text Self-Conditioned Simplex Diffusion},
  journal = {EACL},
  url = {https://arxiv.org/abs/2305.08379},
  year = {2024},
  code = {https://github.com/allenai/tess-diffusion}
}

Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2Hamish Ivison*, Yizhong Wang*, Valentina Pyatkin, Nathan Lambert, Matthew Peters, Pradeep Dasigi, Joel Jang, David Wadden, Noah A. Smith, Iz Beltagy, and Hannaneh Hajishirzi. 2023. technical report.

@article{ivison2023camels,
  title = {Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2},
  author = {Hamish Ivison* and Wang*, Yizhong and Pyatkin, Valentina and Lambert, Nathan and Peters, Matthew and Dasigi, Pradeep and Jang, Joel and Wadden, David and Smith, Noah A. and Beltagy, Iz and Hajishirzi, Hannaneh},
  year = {2023},
  url = {https://arxiv.org/abs/2311.10702},
  eprint = {2311.10702},
  journal = {technical report},
  primaryclass = {cs.CL},
  code = {https://github.com/allenai/open-instruct}
}

How Far Can Camels Go? Exploring the State of Instruction Tuning on Open ResourcesYizhong Wang*, Hamish Ivison*, Pradeep Dasigi, Jack Hessel, Tushar Khot, Khyathi Raghavi Chandu, David Wadden, Kelsey MacMillan, Noah A. Smith, Iz Beltagy, and Hannaneh Hajishirzi. 2023. NeurIPS Datasets and Benchmarks Track.

@article{tulu,
  title = {How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources},
  author = {Wang*, Yizhong and Hamish Ivison* and Dasigi, Pradeep and Hessel, Jack and Khot, Tushar and Chandu, Khyathi Raghavi and Wadden, David and MacMillan, Kelsey and Smith, Noah A. and Beltagy, Iz and Hajishirzi, Hannaneh},
  year = {2023},
  url = {https://arxiv.org/abs/2306.04751},
  eprint = {2306.04751},
  journal = {NeurIPS Datasets and Benchmarks Track},
  primaryclass = {cs.CL},
  code = {https://github.com/allenai/open-instruct}
}

HINT: Hypernetwork Instruction Tuning for Efficient Zero-Shot GeneralisationHamish Ivison, Akshita Bhagia, Yizhong Wang, Hannaneh Hajishirzi, and Matthew Peters. 2023. ACL.

@article{hint,
  author = {Hamish Ivison and Bhagia, Akshita and Wang, Yizhong and Hajishirzi, Hannaneh and Peters, Matthew},
  title = {HINT: Hypernetwork Instruction Tuning for Efficient Zero-Shot Generalisation},
  journal = {ACL},
  url = {https://arxiv.org/abs/2212.10315},
  year = {2023},
  code = {https://github.com/allenai/hyper-task-descriptions}
}

Data-Efficient Finetuning Using Cross-Task Nearest NeighborsHamish Ivison, Noah A. Smith, Hannaneh Hajishirzi, and Pradeep Dasigi. 2023. Findings of ACL.

@article{deft,
  author = {Hamish Ivison and Smith, Noah A. and Hajishirzi, Hannaneh and Dasigi, Pradeep},
  title = {Data-Efficient Finetuning Using Cross-Task Nearest Neighbors},
  journal = {Findings of ACL},
  code = {https://github.com/allenai/data-efficient-finetuning},
  url = {https://arxiv.org/abs/2212.00196},
  year = {2023}
}

Hyperdecoders: Instance-specific decoders for multi-task NLPHamish Ivison and Matthew E. Peters. 2022. Findings of EMNLP.

@article{hyperdecoders,
  url = {https://arxiv.org/abs/2203.08304},
  author = {Hamish Ivison and Peters, Matthew E.},
  keywords = {Computation and Language (cs.CL), FOS: Computer and information sciences, FOS: Computer and information sciences},
  title = {Hyperdecoders: Instance-specific decoders for multi-task NLP},
  journal = {Findings of EMNLP},
  year = {2022},
  code = {https://github.com/allenai/hyperdecoders}
}

Local Interpretations for Explainable Natural Language Processing: A SurveySiwen Luo*, Hamish Ivison*, Soyeon Caren Han, and Josiah Poon. 2021. ACM Computing Surveys.

@article{localinterp,
  author = {Luo*, Siwen and Hamish Ivison* and Han, Soyeon Caren and Poon, Josiah},
  title = {Local Interpretations for Explainable Natural Language Processing:
                 {A} Survey},
  year = {2021},
  url = {https://arxiv.org/abs/2103.11072},
  journal = {ACM Computing Surveys},
  eprint = {2103.11072},
  timestamp = {Wed, 24 Mar 2021 15:50:40 +0100}
}

Would you like fries with that? Modular Multi-hop ReasoningHamish Ivison. 2020. November.

@thesis{thesis,
  author = {Hamish Ivison},
  title = {Would you like fries with that? Modular Multi-hop Reasoning},
  school = {University of Sydney},
  type = {Honours Thesis},
  year = {2020},
  month = nov,
  url = {/assets/static/thesis.pdf}
}