feat: new design for automata theory, results on regular languages, and deterministic labelled transition systems #144

fmontesi · 2025-11-07T12:02:01Z

This PR merges the developments that @ctchou and I carried out in the automata branch, #141, and #142, as discussed on Zulip (#CSLib > Question Lean structure and extends). If this is merged, the automata branch and the two mentioned PRs should be erased.

The design based on Acceptor and structure extension works pretty well. Many definitions could be removed because of the Acceptor typeclass, and some proofs got easier thanks to the combination of structure extension, record updates, and grind.

Co-authored with @ctchou.

…ciated changes

…ga-automata

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

fmontesi · 2025-11-07T12:15:18Z

bot fix style

chenson2018 · 2025-11-07T12:39:52Z

bot fix style

Does this work??? Very cool if so. I was surprised to see the bot commenting at all actually, it had been silently failing to post recently... did you do something to fix it?

chenson2018

Just some small style suggestions. The design seems fine to me, but I think you are more in touch with the requirements than I am.

chenson2018 · 2025-11-07T12:42:40Z

Cslib/Computability/Automata/Acceptor.lean

+namespace Cslib.Automata
+
+/-- An `Acceptor` is a machine that recognises strings (lists of symbols in an alphabet). -/
+class Acceptor (α : Type _) (Symbol : outParam (Type _)) where


It is mildly jarring to half follow the Mathlib convention of lowercase Greek for type variables.

I was in doubt myself, it gives some goosebumps. But I couldn't come up with a better name than alpha.. maybe A for automaton or M for machine?

Cslib/Computability/Automata/DAToNA.lean

Cslib/Computability/Automata/EpsilonNAToNA.lean

Cslib/Computability/Languages/RegularLanguage.lean

chenson2018 · 2025-11-07T13:26:59Z

Cslib/Computability/Languages/RegularLanguage.lean

+  rw [IsRegular.iff_cslib_dfa] at h ⊢
+  obtain ⟨State, _, ⟨da, acc⟩, rfl⟩ := h
+  use State, inferInstance, ⟨da, accᶜ⟩
+  ext ; grind


Suggested change

ext ; grind

grind

Cslib/Foundations/Semantics/LTS/FLTSToLTS.lean

ctchou · 2025-11-07T17:12:04Z

Can we not use "Finite" in DA.Finite and NA.Finite? This conflicts with the use of "Finite" in Finite T (where T is a type), which also appears in our code. The "Finite" in {DA,NA}.Finite refers to the finite-ness of the accepted run, while the "Finite" in Finite T refers to the finite-ness of the cardinality of T. Now we end up having expressions like this:

theorem IsRegular.iff_cslib_dfa {l : Language Symbol} : l.IsRegular ↔ ∃ State : Type, ∃ _ : Finite State, ∃ dfa : Cslib.Automata.DA.Finite State Symbol, Cslib.Automata.Acceptor.language dfa = l := by

in which the two "Finite"s mean totally different things. Even more confusing is that "finite automata" is a well-established terminology where the "finite" means that at least the state space is finite and perhaps the alphabet is finite as well.

That is why I chose the name FinAcc in my PR. FinRun is another possible name.

See the 1st commit of #145:
99fb99e
which renames {DA,NA}.Finite to {DA,NA}.FinAcc.

ctchou · 2025-11-07T17:17:36Z

I suggest we merge Acceptor.lean and OmegaAcceptor.lean into a single Accept.lean. They are imported together by both DA.lean and NA.lean, which are at the bottom of the import hierarchy. anyway There is no point splitting them.

fmontesi · 2025-11-07T18:09:55Z

Re FinAcc: agreed, I'll switch.

Re Accept: I need to think about it a bit. My thinking was that they're conceptually separate. Maybe we'll use Acceptor somewhere else, or maybe the different substructures of DA and NA will get bigger and merit their own files.

Re all the patches by @chenson2018 : will integrate.

ctchou · 2025-11-07T18:20:33Z

For the Finite -> FinAcc renaming, I've already made a patch in cslib#145 at commit: 99fb99e

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Co-authored-by: Chris Henson <[email protected]>

fmontesi · 2025-11-07T19:06:20Z

For the Finite -> FinAcc renaming, I've already made a patch in cslib#145 at commit: 99fb99e

That's great, thanks. I'll merge it as soon as I'm done applying all the other suggestions.

ctchou · 2025-11-07T19:07:57Z

In the commit:
4c1d61f
I added the attributes language and mem_language to {DA,NA}.FinAcc so that we can write {da,na}.language rather than having to explicitly mention Acceptor. For example, RegularLanguage.lean is improved as follows:

`
@@ -26,7 +25,7 @@ variable {Symbol : Type*}
/-- A characterization of Language.IsRegular using Cslib.DA -/
theorem IsRegular.iff_cslib_dfa {l : Language Symbol} :
l.IsRegular ↔ ∃ State : Type, ∃ _ : Finite State,

 ∃ dfa : Cslib.Automata.DA.FinAcc State Symbol, Cslib.Automata.Acceptor.language dfa = l := by

```
 ∃ dfa : Cslib.Automata.DA.FinAcc State Symbol, dfa.language = l := by
```
constructor
· rintro ⟨State, h_fin, ⟨tr, start, acc⟩, rfl⟩
let dfa := Cslib.Automata.DA.FinAcc.mk {tr, start} acc
@@ -40,7 +39,7 @@ theorem IsRegular.iff_cslib_dfa {l : Language Symbol} :
/-- A characterization of Language.IsRegular using Cslib.NA -/
theorem IsRegular.iff_cslib_nfa {l : Language Symbol} :
l.IsRegular ↔ ∃ State : Type, ∃ _ : Finite State,

 ∃ nfa : Cslib.Automata.NA.FinAcc State Symbol, Cslib.Automata.Acceptor.language nfa = l := by

```
 ∃ nfa : Cslib.Automata.NA.FinAcc State Symbol, nfa.language = l := by
```
rw [IsRegular.iff_cslib_dfa]; constructor
· rintro ⟨State, h_fin, ⟨da, acc⟩, rfl⟩
use State, h_fin, ⟨da.toNA, acc⟩
`

If you like this, I can make similar changes to {DA,NA}.{Buchi,Muller} as well.

fmontesi · 2025-11-07T19:36:07Z

I thought and tried to do the same. My first impression was that we get better ergonomics with making them abbrev.

But I chose to avoid it for now because it adds redundant definitions to every automaton, which kinda kills the point of having a typeclass. There were discussions about supporting dot notation for instances of a class, which would solve our problem. I'd like to take this discussion up with the Lean developers first. (I'll do that now.)

So I'm just gonna cherry-pick 99fb99e for now (the FinAcc patch). We can always revisit this later.

fmontesi · 2025-11-07T20:19:29Z

All done.

I've been thinking and searching more about the dot notation thing from the typeclass: if we really want to use dot notation for language, Accepts, etc., couldn't we make a macro to make all these thin abbreviations?

I'm not sure we should though.. we should ask the mathlibbers about their experience with this.

ctchou · 2025-11-07T22:05:56Z

In my opinion, such "redundant" definitions and theorems are very much in the style of Lean mathlib. For example, take a look at:
https://leanprover-community.github.io/mathlib4_docs/Mathlib/Computability/Language.html
One could argue that all the mem_... theorems there are "redundant", for they are nothing but a slight re-statement of the definitions. In fact, all the ..._defs are actually theorems, not the real definitions, which are mostly instances very much like in our case.

More importantly, don't you find an expression like:
∃ dfa : Cslib.Automata.DA.FinAcc State Symbol, Cslib.Automata.Acceptor.language dfa = l
just intolerable? One advantage of bundling the accepting states in the acceptor is that we don't have to mention and quantifying over them explicitly in a theorem like the one above. If we have to write Cslib.Automata.Acceptor.language dfs, what advantage have we got?

ctchou · 2025-11-07T22:21:14Z

Here's an example from mathlib's Language:
`
instance : Mul (Language α) :=
⟨image2 (· ++ ·)⟩

theorem mul_def (l m : Language α) : l * m = image2 (· ++ ·) l m :=
rfl

theorem mem_mul : x ∈ l * m ↔ ∃ a ∈ l, ∃ b ∈ m, a ++ b = x :=
mem_image2
`
The {DA.NA}.FinAcc.{language,mem_language} follow exactly the same pattern.

fmontesi · 2025-11-08T11:05:28Z

Not all of these examples are direct copies of the definitions/theorems in the class, but that's beyond the point: as I wrote, my problem is not with having the copied definitions for dot notation (btw, these definitions should always just be a reference to the class definitions/theorems), but more with doing so manually. I would just much rather explore a systematic way, e.g., supplying a simple annotation when one creates an instance that produces all the copies needed for dot notation automatically. I'm now discussing this on Zulip.

fmontesi · 2025-11-08T11:32:41Z

Link to the Zulip discussion: #Is there code for X? > Deriving dot notation from class instances

ctchou · 2025-11-08T17:47:26Z

Another response to the problem we are facing in this PR is to ask: Why is having an "acceptor" class preferable to the last design in #142, which doesn't have an "acceptor" class? What exactly is being gained from having an "acceptor" class?

BTW, this brings up another question: Why do you need two acceptor classes? Acceptor and OmegaAcceptor are identical, except for the some types. If you want to abstract the notion of "accepting" into an "acceptor" class, why not go all the way and have a single "acceptor" class and two different instantiations of it?

fmontesi · 2025-11-08T18:08:00Z

Code deduplication.
Right you are, I could define

/-- An `Acceptor` is a machine that recognises strings (lists of symbols in an alphabet). -/
class Acceptor (α : Type _) (β : outParam (Type _)) where
  /-- Predicate that establishes whether a string `xs` is accepted. -/
  Accepts (a : α) (b : β) : Prop

Then we'd have things like:

instance : Acceptor (Buchi State Symbol) (ωSequence Symbol) where
  Accepts (a : Buchi State Symbol) (xs : ωSequence Symbol) := ∃ᶠ k in atTop, a.run xs k ∈ a.accept

which looks totally fine to me. I'll get to it, that's even more deduplication. :-)

ctchou · 2025-11-08T18:26:50Z

Do we really get code deduplication? If I still have to explicitly make instances of language and mem_language in the style of the Mul example from Language, what code deduplication do I get? I asked the question on Zulip too:
https://leanprover.zulipchat.com/#narrow/channel/217875-Is-there-code-for-X.3F/topic/Deriving.20dot.20notation.20from.20class.20instances/near/554501460

ctchou · 2025-11-08T18:37:58Z

Note that the Mul example from Language actually gets code deduplication, because there are many theorems already proved about image2 which can now be instantiated to produce theorems about Mul. (This actually happens in Language.) But do we have any analogues here? Are there nontrivial abstract theorems about Acceptor that can be proved and re-used by its instances? This seems to me rather unlikely, because Acceptor contains only a predicate, a set defined by the predicate, and a theorem relating the predicate and the set. What sophisticated theorems can be proved about them that are not already in mathlib?

…Cslib in some automata files

fmontesi · 2025-11-09T11:45:42Z

Code deduplication.
Right you are, I could define

/-- An `Acceptor` is a machine that recognises strings (lists of symbols in an alphabet). -/
class Acceptor (α : Type _) (β : outParam (Type _)) where
  /-- Predicate that establishes whether a string `xs` is accepted. -/
  Accepts (a : α) (b : β) : Prop

Then we'd have things like:

instance : Acceptor (Buchi State Symbol) (ωSequence Symbol) where
  Accepts (a : Buchi State Symbol) (xs : ωSequence Symbol) := ∃ᶠ k in atTop, a.run xs k ∈ a.accept

which looks totally fine to me. I'll get to it, that's even more deduplication. :-)

Mmh actually doesn't work quite as expected, as Acceptor.language returns a Language whereas OmegaAcceptor.language returns an OmegaLanguage... I'll keep them separate for the time being.

ctchou · 2025-11-09T22:13:35Z

Zulip thread: https://leanprover.zulipchat.com/#narrow/channel/513188-CSLib/topic/deriving.20dot.20notation/with/554560824

fmontesi and others added 14 commits November 1, 2025 14:36

feat: restructuring of automata theory (WIP)

cc87d82

fix: DFA.acceptor

77a7605

feat: add Cslib/Computability/Languages/RegularLanguage.lean and asso…

930352f

…ciated changes

Fix linter failures

edb6c9d

Incorporate Chris Henson's and Eric Wieser's comments

71e4e21

Unbundle typeclasses from DFA

39181f4

feat: unbundled design for automata theory

add8af2

feat: FLTS

bbffffe

Incorporates Chris Henson's comments

00d29c0

More work on automata

080befc

add omega acceptor

90a6413

Incorporate Fabrizio Montesi's comments

0185af7

feat (Automata): merge work on Acceptor and the different kind of ome…

6c23353

…ga-automata

fix (Automata): reinstate EpsilonNA

87ccb6e

fmontesi requested a review from chenson2018 as a code owner November 7, 2025 12:02

fmontesi added the automata label Nov 7, 2025

Update Cslib/Computability/Languages/RegularLanguage.lean

b4ff134

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

fix: add files to Cslib.lean

7ead0e5

chenson2018 requested changes Nov 7, 2025

View reviewed changes

ctchou mentioned this pull request Nov 7, 2025

chore: modifications of cslib#144 #145

Closed

fmontesi and others added 4 commits November 7, 2025 19:56

Update Cslib/Computability/Languages/RegularLanguage.lean

47dcb3b

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Update Cslib/Computability/Languages/RegularLanguage.lean

7900e9f

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Update Cslib/Computability/Languages/RegularLanguage.lean

dd12503

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Update Cslib/Computability/Languages/RegularLanguage.lean

ce18771

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

fmontesi and others added 6 commits November 7, 2025 20:01

Update Cslib/Computability/Automata/DAToNA.lean

04a923d

Co-authored-by: Chris Henson <[email protected]>

Update Cslib/Computability/Automata/EpsilonNAToNA.lean

e8af0c6

Co-authored-by: Chris Henson <[email protected]>

Update Cslib/Computability/Automata/EpsilonNAToNA.lean

c4f52c0

Co-authored-by: Chris Henson <[email protected]>

chore: open scoped before theorem in EpsilonNAToNA

e34ee88

Update Cslib/Computability/Languages/RegularLanguage.lean

a2e9f48

Co-authored-by: Chris Henson <[email protected]>

Update Cslib/Foundations/Semantics/LTS/FLTSToLTS.lean

b6e90c0

Co-authored-by: Chris Henson <[email protected]>

Globally rename {DA,NA}.Finite to {DA,NA}.FinAcc

6f0e68f

fmontesi requested a review from chenson2018 November 7, 2025 20:17

chore: make theorem statements more readable by opening Acceptor and …

45a57cd

…Cslib in some automata files

some namespacing and better grind annotations for automata equivalences

df16eea

chenson2018 approved these changes Nov 9, 2025

View reviewed changes

fmontesi merged commit 61296f1 into main Nov 9, 2025
4 checks passed

This was referenced Nov 9, 2025

feat: add Cslib/Computability/Languages/RegularLanguage.lean and associated changes #141

Closed

feat: unbundled design for automata theory #142

Closed

feat: An initial version of Buchi and Muller automata #85

Closed

fmontesi deleted the automata branch November 17, 2025 10:30

feat: new design for automata theory, results on regular languages, and deterministic labelled transition systems #144

feat: new design for automata theory, results on regular languages, and deterministic labelled transition systems #144

Uh oh!

Conversation

fmontesi commented Nov 7, 2025

Uh oh!

fmontesi commented Nov 7, 2025

Uh oh!

chenson2018 commented Nov 7, 2025

Uh oh!

chenson2018 left a comment

Choose a reason for hiding this comment

Uh oh!

chenson2018 Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

fmontesi Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chenson2018 Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ctchou commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ctchou commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fmontesi commented Nov 7, 2025

Uh oh!

ctchou commented Nov 7, 2025

Uh oh!

fmontesi commented Nov 7, 2025

Uh oh!

ctchou commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fmontesi commented Nov 7, 2025

Uh oh!

fmontesi commented Nov 7, 2025

Uh oh!

ctchou commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ctchou commented Nov 7, 2025

Uh oh!

fmontesi commented Nov 8, 2025

Uh oh!

fmontesi commented Nov 8, 2025

Uh oh!

ctchou commented Nov 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fmontesi commented Nov 8, 2025

Uh oh!

ctchou commented Nov 8, 2025

Uh oh!

ctchou commented Nov 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fmontesi commented Nov 9, 2025

Uh oh!

Uh oh!

ctchou commented Nov 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ctchou commented Nov 7, 2025 •

edited

Loading

ctchou commented Nov 7, 2025 •

edited

Loading

ctchou commented Nov 7, 2025 •

edited

Loading

ctchou commented Nov 7, 2025 •

edited

Loading

ctchou commented Nov 8, 2025 •

edited

Loading

ctchou commented Nov 8, 2025 •

edited

Loading