Teaching

What are you teaching this semester?

In Fall 2025, I'm teaching E 85: Digital Electronics & Computer Architecture.

I'm also Co-Director of the Math+CS Clinic Program with Prof. Talithia Williams (Math) and Prof. Ben Wiedermann (CS).

When are your office hours?

Good question! TBA

Research

Can computers read proofs?

Definitely!

There has been remarkable progress on proof assistants like Coq, Isabelle, Agda, and Lean. Humans direct the high-level strategy (anywhere from breaking a goal into subgoals and applying simple logical steps, to invoking sophisticated proof-search tactics). The proof assistant fills in the details. The result is a proof with no ambiguities or omissions with every step justified; the computer can verify that the steps fit together correctly and that the proof is flawless.

In some parts of Computer Science, it's now typical for papers claiming a theoretical result to be accompanied by a computer-checked proof. Proof assistants have been making inroads into pure mathematics as well; a well-known example is the Flyspeck Project that used proof assistants to verify once and for all Thomas Hales's intricate proof of the Kepler Conjecture (i.e., that there is no denser packing of spheres than the familiar "pile of cannonballs" arrangement).

Can humans read proofs written for computers?

Often not. Proofs written for humans look like this:

Image of a typical English-language proof

Source

but popular proof assistants expect something much less readable, like:

Source

Source

One might worry that these proofs would be impossible to write, but proof assistant IDEs give huge amounts of guidance, and let the user focus on one small piece of the proof at a time. It's only when all the pieces are put together that it becomes impossible to see the forest for the trees.

Does that matter?

It's fine to construct “write only” proofs when we only care that the final conclusion is true. If we just want to confirm our algorithm sorts all inputs correctly, or that our cryptographical protocol is secure, or that our programming language is type-safe, then any correct proof will do.

But in traditional mathematics, proofs are intended to be read and understood by humans; they give the reader broader insights into the topic by explaining why the result is true.

The disconnect is unfortunate. For example, the authors of the book Homotopy Type Theory verified all their claims using Coq before releasing the book in 2013. Yet to this day, the authors continue receiving reports of errors in the theorems and proofs in their book! The problem is that the theorems and proofs in the book were (re)written in the “traditional” fashion so readers could understand them. These English-language proofs were not formally verified, and typos and more serious errors slipped through.

Could computers read proofs written for humans?

A system that understands English-language proofs would be a great tool for students learning how to write proofs, and for researchers who want to check their proofs before publication. Computers already do spell-checking and grammar-checking, so why not logic-checking too?

There are proof assistants like Mizar, Isar, and Naproche whose input is more human readable, e.g.,

Source

Source

The results range from stilted to impressive, but all of these systems require the user to write using a predetermined set of sentence forms, which may not be how they would prefer to write.

How do people use language in mathematical proofs?

This is something I have been investigating recently, in collaboration with multiple undergraduate researchers. Being able to answer this question would tell us how close systems like Isar and Naproche are to handling input from arbitrary users.

We first downloaded over 1.6M papers from the popular arXiv.org preprint site, and found that 352K of these contained at least one proof. We then wrote a script to extract proofs from the LaTeX source code of these papers, resulting in 3.7M proofs (containing 558M words total). Because we were interested in the use of language, and not the technical results, we replaced all mathematical formulas by the placeholder MATH, resulting in sentences like Let MATH be the restriction of MATH to MATH.

We have spent most of our time buildling this language corpus, and have only scratched the surface of analyzing the language. A few preliminary observations:

The sentence Without loss of generality we may assume that MATH. is twice as common as Without loss of generality we assume that MATH. and five times more common than We may assume without loss of generality that MATH.
Mechanically extracted two-word collocations (words that appear together more often than can be explained by chance) include ad absurdum, almost surely, finitely many, pigeon hole, roughly speaking, and spherical harmonics. Longer collocations include induces an isomorphism between and it suffices to check that.
Off-the-shelf NLP tools can do very badly on mathematical text. For example, when trying to identify the parts of speech for each word in the sentence Suppose MATH is even. the widely-used NLTK library identifies Suppose MATH as two proper nouns instead of a verb and a noun, and identifies even as an adverb instead of an adjective. But by retraining NLTK's part-of-speech tagger with a small collection of imperative sentences extracted from our data, we can get much better results.

What else have you worked on?

Topics include:

What courses do different schools require for their CS Major?

Why would might we reject classical logic?
How could a set be finite without having a size?

Does programming multicore computers have to be hard?

Is there a difference between “equal by definition” and “provably equal”?

Can we type check languages that let us add components to objects at run-time?

How can we reason about language with singleton types?
(E.g., “the type of all numbers equal to 3”)?

Tell me about the CS 131 Random Art assignment!

Personal

Why are you trying to learn Slovenian?

My wife Sneža is from Slovenia, and we visit her relatives there every year or two. I also have a longtime collaborator, Prof. Andrej Bauer, who teaches in the Mathematics Department of the University of Ljubjana, Slovenia.

Teaching

What are you teaching this semester?

When are your office hours?

Research

Can computers read proofs?

Can humans read proofs written for computers?

Does that matter?

Could computers read proofs written for humans?

How do people use language in mathematical proofs?

What else have you worked on?

What courses do different schools require for their CS Major?

Why would might we reject classical logic?How could a set be finite without having a size?

Does programming multicore computers have to be hard?

Is there a difference between “equal by definition” and “provably equal”?

Can we type check languages that let us add components to objects at run-time?

How can we reason about language with singleton types? (E.g., “the type of all numbers equal to 3”)?

Tell me about the CS 131 Random Art assignment!

Personal

Why are you trying to learn Slovenian?

Why would might we reject classical logic?
How could a set be finite without having a size?

How can we reason about language with singleton types?
(E.g., “the type of all numbers equal to 3”)?