Finite Automata

This chapter introduces Finite Automata, a fundamental model for computation central to formal language theory. It systematically explores Deterministic and Non-deterministic Finite Automata, culminating in the proof of their equivalence. Mastery of these concepts is essential for understanding language recognition and is a prerequisite for success in CMI examinations.

Chapter Contents

| Topic |

|---|-------| | 1 | Deterministic Finite Automata (DFA) | | 2 | Non-deterministic Finite Automata (NFA) | | 3 | Equivalence of Automata |

We begin with Deterministic Finite Automata (DFA).

Part 1: Deterministic Finite Automata (DFA)

Deterministic Finite Automata are fundamental computational models used to recognize regular languages. Understanding their construction and properties is crucial for CMI, as they form the basis for analyzing and designing pattern recognition systems.

---

Core Concepts

1. Formal Definition of a DFA

A Deterministic Finite Automaton (DFA) is formally defined as a 5-tuple $M = (Q, \Sigma, \delta, q_0, F)$ .

📖 DFA Components

$Q$ : A finite, non-empty set of states.
$\Sigma$ : A finite, non-empty alphabet of input symbols.
$\delta$ : A total transition function $Q \times \Sigma \to Q$ . For each state and input symbol, there is exactly one next state.
$q_0$ : The initial (start) state, $q_0 \in Q$ .
$F$ : A set of final (accepting) states, $F \subseteq Q$ .

We define the extended transition function $\delta^$ recursively.
For any state $q \in Q$ , string $w \in \Sigma^*$ , and symbol $a \in \Sigma$ :

📐 Extended Transition Function

\begin{aligned} \delta^

Where:

\varepsilon

is the empty string. When to use: To determine the final state after processing an entire string.

The language accepted by a DFA $M$ , denoted $L(M)$ , is the set of all strings $w \in \Sigma^$ such that $\delta^$ (q_0, w) \in F $δ^{*} (q_{0}, w) \in F$ .

Worked Example: Trace a string on a given DFA.

Consider a DFA $M = (\{q_0, q_1, q_2\}, \{0, 1\}, \delta, q_0, \{q_2\})$ with transitions:
$\delta(q_0, 0) = q_0$ , $\delta(q_0, 1) = q_1$
$\delta(q_1, 0) = q_2$ , $\delta(q_1, 1) = q_1$
$\delta(q_2, 0) = q_2$ , $\delta(q_2, 1) = q_2$
Determine if the string $w = 101$ is accepted.

Step 1: Initialize with the start state and empty string.

\delta^*(q_0, \varepsilon) = q_0

Step 2: Process the first symbol '1'.

\delta^

Step 3: Process the second symbol '0'.

\delta^

Step 4: Process the third symbol '1'.

\delta^

Answer: Since $\delta^*(q_0, 101) = q_2$ and $q_2 \in F$ , the string $101$ is accepted by $M$ .

:::question type="MCQ" question="Consider a DFA $M = (\{q_0, q_1, q_2\}, \{a, b\}, \delta, q_0, \{q_1\})$ with transitions: $\delta(q_0, a) = q_1$ , $\delta(q_0, b) = q_0$ ; $\delta(q_1, a) = q_1$ , $\delta(q_1, b) = q_2$ ; $\delta(q_2, a) = q_2$ , $\delta(q_2, b) = q_2$ . Which of the following strings is accepted by $M$ ?" options=[" $abb$ "," $ba$ "," $aab$ "," $\varepsilon$ "] answer=" $aab$ " hint="Trace each string through the DFA using the extended transition function and check if the final state is an accepting state." solution="Step 1: Trace $abb$ :
>

\delta^*(q_0, a) = q_1

\delta^*(q_0, ab) = \delta(q_1, b) = q_2

\delta^*(q_0, abb) = \delta(q_2, b) = q_2

Since

q_2 \notin F

abb

is not accepted.

Step 2: Trace $ba$ :
>

\delta^*(q_0, b) = q_0

\delta^*(q_0, ba) = \delta(q_0, a) = q_1

Since

q_1 \in F

ba

is accepted.

Step 3: Trace $aab$ :
>

\delta^*(q_0, a) = q_1

\delta^*(q_0, aa) = \delta(q_1, a) = q_1

\delta^*(q_0, aab) = \delta(q_1, b) = q_2

Since

q_2 \notin F

aab

is not accepted.

Step 4: Trace $\varepsilon$ :
>

\delta^*(q_0, \varepsilon) = q_0

Since

q_0 \notin F

\varepsilon

is not accepted.

Therefore, only $ba$ is accepted. The options given in the question are not matching the correct answer from the provided options. Let's re-evaluate the question and options provided by the user. The options provided were for a different question. I will generate a question with options based on my solution.

Let's use the provided solution to create a correct MCQ.
Original question: Which of the following strings are accepted by it? Options: ["00111","00110110","011010"," $\varepsilon$ "]
The provided solution for PYQ 5 states $q_0, q_3, q_5$ are final states.
Let's re-trace the example to match a common scenario.

Consider a DFA $M = (\{q_0, q_1, q_2\}, \{a, b\}, \delta, q_0, \{q_1\})$ with transitions:
$\delta(q_0, a) = q_1$ , $\delta(q_0, b) = q_0$
$\delta(q_1, a) = q_1$ , $\delta(q_1, b) = q_2$
$\delta(q_2, a) = q_2$ , $\delta(q_2, b) = q_2$

String $a$ : $\delta^*(q_0, a) = q_1$ . $q_1 \in F$ . Accepted.
String $b$ : $\delta^*(q_0, b) = q_0$ . $q_0 \notin F$ . Not accepted.
String $aa$ : $\delta^*(q_0, aa) = \delta(q_1, a) = q_1$ . $q_1 \in F$ . Accepted.
String $ab$ : $\delta^*(q_0, ab) = \delta(q_1, b) = q_2$ . $q_2 \notin F$ . Not accepted.
String $aab$ : $\delta^*(q_0, aab) = \delta(q_1, b) = q_2$ . $q_2 \notin F$ . Not accepted.
String $ba$ : $\delta^*(q_0, ba) = \delta(q_0, a) = q_1$ . $q_1 \in F$ . Accepted.

The question options need to be carefully chosen.
Let's choose options that make sense with the example DFA.

Corrected Question:
Consider a DFA $M = (\{q_0, q_1, q_2\}, \{a, b\}, \delta, q_0, \{q_1\})$ with transitions: $\delta(q_0, a) = q_1$ , $\delta(q_0, b) = q_0$ ; $\delta(q_1, a) = q_1$ , $\delta(q_1, b) = q_2$ ; $\delta(q_2, a) = q_2$ , $\delta(q_2, b) = q_2$ . Which of the following strings is accepted by $M$ ?
Options: [" $abb$ "," $ba$ "," $aab$ "," $\varepsilon$ "]
Answer: " $ba$ "
Solution:
Step 1: Trace $abb$ :
>

\delta^*(q_0, a) = q_1

\delta^*(q_0, ab) = \delta(q_1, b) = q_2

\delta^*(q_0, abb) = \delta(q_2, b) = q_2

Since

q_2 \notin F

abb

is not accepted.

Step 2: Trace $ba$ :
>

\delta^*(q_0, b) = q_0

\delta^*(q_0, ba) = \delta(q_0, a) = q_1

Since

q_1 \in F

ba

is accepted.

Step 3: Trace $aab$ :
>

\delta^*(q_0, a) = q_1

\delta^*(q_0, aa) = \delta(q_1, a) = q_1

\delta^*(q_0, aab) = \delta(q_1, b) = q_2

Since

q_2 \notin F

aab

is not accepted.

Step 4: Trace $\varepsilon$ :
>

\delta^*(q_0, \varepsilon) = q_0

Since

q_0 \notin F

\varepsilon

is not accepted.

The correct accepted string is $ba$ .
"
:::

---

2. DFA Construction: Basic Pattern Recognition

We construct DFAs by designing states to remember relevant information about the input string seen so far. Common patterns include prefixes, suffixes, and substrings.

Worked Example: Construct a DFA for the language $L = \{w \in \{a,b\}^* \mid w \text{ contains } ab \text{ as a substring}\}$ .

Step 1: Define states based on the longest suffix of the input that is also a prefix of the target pattern "ab".

$q_0$ : Initial state, no part of "ab" seen, or previous character does not help. (Corresponds to $\varepsilon$ )

$q_1$ : 'a' seen, but not 'ab'. (Corresponds to 'a')

$q_2$ : 'ab' seen. (Accepting state)

Step 2: Define the alphabet

\Sigma = \{a, b\}

Step 3: Define transitions $\delta$ :

From $q_0$ :

- On 'a': We've seen 'a', potentially the start of 'ab'. Go to

q_1

.
- On 'b': We've seen 'b', which doesn't help form 'ab' from

q_0

. Stay in

q_0

From $q_1$ : (We've seen 'a')

- On 'a': We've seen another 'a'. The longest suffix of the string that is a prefix of 'ab' is still 'a'. Stay in

q_1

.
- On 'b': We've seen 'ab'. Go to

q_2

From $q_2$ : (We've seen 'ab')

- On 'a' or 'b': Once 'ab' has been seen, any subsequent input keeps it accepted. Stay in

q_2

Step 4: Identify initial and final states.

$q_0$ is the initial state.

$F = \{q_2\}$ is the set of final states.

Answer: The DFA is

M = (\{q_0, q_1, q_2\}, \{a, b\}, \delta, q_0, \{q_2\})

with transitions:

\delta(q_0, a) = q_1, \delta(q_0, b) = q_0

\delta(q_1, a) = q_1, \delta(q_1, b) = q_2

\delta(q_2, a) = q_2, \delta(q_2, b) = q_2

:::question type="MCQ" question="Construct a DFA over $\Sigma = \{0, 1\}$ that accepts all strings that end with $01$ ." options=["DFA with states tracking the last two symbols.","DFA with a single accepting state reached only after $01$ .","DFA that tracks if $01$ has been seen anywhere.","DFA with states corresponding to length modulo 2."] answer="DFA with states tracking the last two symbols." hint="To recognize a suffix, the DFA must remember the most recent symbols. Design states to represent the last seen symbol, or lack thereof, and whether the suffix matches the target." solution="Step 1: Define states. We need to remember if we just saw a $0$ , or if we just saw a $1$ (after a $0$ ), or nothing useful.

$q_0$ : Initial state, no relevant suffix seen.

$q_1$ : Last symbol seen was $0$ .

$q_2$ : Last two symbols seen were $01$ . (Accepting state)

Step 2: Define transitions.

From $q_0$ :

- On

0

: Go to

q_1

.
- On

1

: Stay in

q_0

(last symbol is

1

, not

01

From $q_1$ : (Last symbol was $0$ )

- On

0

: Stay in

q_1

(last symbol is still

0

).
- On

1

: Go to

q_2

(last two symbols are

01

From $q_2$ : (Last two symbols were $01$ )

- On

0

: Go to

q_1

(last symbol is

0

).
- On

1

: Stay in

q_0

(last symbol is

1

, not

01

Step 3: Final states. $F = \{q_2\}$ .

This DFA correctly identifies strings ending with $01$ . The option 'DFA with states tracking the last two symbols' best describes this approach."
:::

Worked Example: Construct a DFA over $\Sigma = \{a,b,c\}$ for $L_{\mathrm{even}}$ , the set of all even length strings. (PYQ 8 adapted)

Step 1: Define states based on the parity of the string length.

$q_0$ : Length is even.

$q_1$ : Length is odd.

Step 2: Define the alphabet

\Sigma = \{a,b,c\}

Step 3: Define transitions $\delta$ :

From $q_0$ : (Current length is even)

- On any symbol (

a, b, c

): Length becomes odd. Go to

q_1

From $q_1$ : (Current length is odd)

- On any symbol (

a, b, c

): Length becomes even. Go to

q_0

Step 4: Identify initial and final states.

$q_0$ is the initial state (empty string has length 0, which is even).

$F = \{q_0\}$ is the set of final states.

Answer: The DFA is

M = (\{q_0, q_1\}, \{a, b, c\}, \delta, q_0, \{q_0\})

with transitions:

\delta(q_0, a) = q_1, \delta(q_0, b) = q_1, \delta(q_0, c) = q_1

\delta(q_1, a) = q_0, \delta(q_1, b) = q_0, \delta(q_1, c) = q_0

:::question type="NAT" question="What is the minimum number of states required for a DFA over $\Sigma=\{0,1\}$ that accepts strings containing an odd number of $1$ s?" answer="2" hint="Consider how many distinct 'counts' of 1s you need to track. Parity is sufficient." solution="Step 1: Identify the information to track. We need to know if the count of $1$ s seen so far is odd or even.

Step 2: Define states.

$q_{\text{even}}$ : The string seen so far has an even number of $1$ s.

$q_{\text{odd}}$ : The string seen so far has an odd number of $1$ s.

Step 3: Define transitions.

From $q_{\text{even}}$ :

- On

0

: Parity of

1

s remains even. Stay in

q_{\text{even}}

.
- On

1

: Parity of

1

s becomes odd. Go to

q_{\text{odd}}

From $q_{\text{odd}}$ :

- On

0

: Parity of

1

s remains odd. Stay in

q_{\text{odd}}

.
- On

1

: Parity of

1

s becomes even. Go to

q_{\text{even}}

Step 4: Initial and final states.

Initial state: $q_{\text{even}}$ (empty string has 0 $1$ s, which is even).

Final state: $q_{\text{odd}}$ (we want an odd number of $1$ s).

This DFA requires 2 states. It is the minimum possible because the language requires distinguishing between strings with an odd number of

1

s and strings with an even number of

1

s. For example,

\varepsilon

(even) and

1

(odd) must lead to different equivalence classes, implying at least two states."
:::

---

3. DFA Construction: Divisibility Problems

DFAs can be effectively used to recognize languages where strings, interpreted as numbers in some base, are divisible by a fixed integer. This typically involves using states to represent remainders modulo the divisor.

Worked Example: Design a DFA that accepts strings $x \in \{0,1,2\}^*$ such that $\operatorname{val}(x)$ is divisible by $4$ , where $\operatorname{val}(x)$ is the ternary value of $x$ with the leftmost digit being the most significant. (PYQ 10 adapted)

Step 1: Define states. We need to track the remainder of the ternary value modulo $4$ .

$q_0, q_1, q_2, q_3$ : State $q_i$ means the ternary value read so far is congruent to $i \pmod 4$ .

Step 2: Define the alphabet

\Sigma = \{0,1,2\}

Step 3: Define transitions $\delta$ . If the current remainder is $i$ and we read digit $b$ , the new value is $3 \cdot (\text{old value}) + b$ . So, the new remainder is $(3i + b) \pmod 4$ .

From $q_0$ :

- On

0

(3 \cdot 0 + 0) \pmod 4 = 0 \implies q_0

- On

1

(3 \cdot 0 + 1) \pmod 4 = 1 \implies q_1

- On

2

(3 \cdot 0 + 2) \pmod 4 = 2 \implies q_2

From $q_1$ :

- On

0

(3 \cdot 1 + 0) \pmod 4 = 3 \implies q_3

- On

1

(3 \cdot 1 + 1) \pmod 4 = 0 \implies q_0

- On

2

(3 \cdot 1 + 2) \pmod 4 = 1 \implies q_1

From $q_2$ :

- On

0

(3 \cdot 2 + 0) \pmod 4 = 2 \implies q_2

- On

1

(3 \cdot 2 + 1) \pmod 4 = 3 \implies q_3

- On

2

(3 \cdot 2 + 2) \pmod 4 = 0 \implies q_0

From $q_3$ :

- On

0

(3 \cdot 3 + 0) \pmod 4 = 1 \implies q_1

- On

1

(3 \cdot 3 + 1) \pmod 4 = 2 \implies q_2

- On

2

(3 \cdot 3 + 2) \pmod 4 = 3 \implies q_3

Step 4: Identify initial and final states.

Initial state: $q_0$ (empty string has value 0, which is $0 \pmod 4$ ).

Final state: $q_0$ (for divisibility by $4$ ).

Answer: The DFA is

M = (\{q_0, q_1, q_2, q_3\}, \{0, 1, 2\}, \delta, q_0, \{q_0\})

with the transitions defined above.

:::question type="NAT" question="What is the minimum number of states required for a DFA over $\Sigma=\{0,1\}$ that accepts binary strings whose decimal value is divisible by $5$ ? (Assume the most significant digit is on the left)." answer="5" hint="For divisibility by $N$ , you typically need $N$ states to track all possible remainders modulo $N$ . The transitions are based on $(2 \cdot \text{current_remainder} + \text{new_digit}) \pmod N$ ." solution="Step 1: Identify the information to track. We need to track the remainder of the binary value modulo $5$ .

Step 2: Define states. We need $5$ states, $q_0, q_1, q_2, q_3, q_4$ , where $q_i$ represents that the binary value processed so far has a remainder of $i$ when divided by $5$ .

Step 3: Define transitions. If the current remainder is $i$ and we read a new digit $b \in \{0,1\}$ , the new value is $2 \cdot (\text{old value}) + b$ . So the new remainder is $(2i + b) \pmod 5$ .

From $q_0$ :

- On

0

(2 \cdot 0 + 0) \pmod 5 = 0 \implies q_0

- On

1

(2 \cdot 0 + 1) \pmod 5 = 1 \implies q_1

From $q_1$ :

- On

0

(2 \cdot 1 + 0) \pmod 5 = 2 \implies q_2

- On

1

(2 \cdot 1 + 1) \pmod 5 = 3 \implies q_3

From $q_2$ :

- On

0

(2 \cdot 2 + 0) \pmod 5 = 4 \implies q_4

- On

1

(2 \cdot 2 + 1) \pmod 5 = 0 \implies q_0

From $q_3$ :

- On

0

(2 \cdot 3 + 0) \pmod 5 = 1 \implies q_1

- On

1

(2 \cdot 3 + 1) \pmod 5 = 2 \implies q_2

From $q_4$ :

- On

0

(2 \cdot 4 + 0) \pmod 5 = 3 \implies q_3

- On

1

(2 \cdot 4 + 1) \pmod 5 = 4 \implies q_4

Step 4: Initial and final states.

Initial state: $q_0$ (empty string has value 0, which is $0 \pmod 5$ ).

Final state: $q_0$ (for divisibility by $5$ ).

The minimum number of states required is

5

because each remainder modulo

5

must be distinguishable, and they all appear in the state transitions."
:::

---

4. DFA Construction: Complex Pattern Tracking

Some languages require tracking multiple properties or more intricate relationships between symbols. This often leads to more states, where each state encodes a unique combination of relevant information.

Worked Example: Construct a DFA for the language $L$ consisting of all binary strings with an equal number of occurrences of $01$ and $10$ as substrings. (PYQ 3 & 4 adapted)

Step 1: Analyze the property. The number of $01$ occurrences equals the number of $10$ occurrences if and only if the first symbol of the string is the same as the last symbol, or the string is empty.
This holds because each $01$ transitions from a $0$ -block to a $1$ -block, and each $10$ transitions from a $1$ -block to a $0$ -block. For the counts to be equal, the 'net change' in block type must be zero, meaning the string starts and ends with the same block type.

Step 2: Define states to track the first symbol and the current last symbol.

$q_\varepsilon$ : Empty string (accepting).

$q_{00}$ : String started with $0$ , currently ends with $0$ .

$q_{01}$ : String started with $0$ , currently ends with $1$ .

$q_{11}$ : String started with $1$ , currently ends with $1$ .

$q_{10}$ : String started with $1$ , currently ends with $0$ .

Step 3: Define the alphabet

\Sigma = \{0, 1\}

Step 4: Define transitions $\delta$ :

From $q_\varepsilon$ :

- On

0

: String starts and ends with

0

. Go to

q_{00}

.
- On

1

: String starts and ends with

1

. Go to

q_{11}

From $q_{ij}$ (where $i$ is first symbol, $j$ is current last symbol):

- On

b \in \{0, 1\}

: The first symbol

i

remains, the new last symbol is

b

. Go to

q_{ib}

.
- Example:

\delta(q_{00}, 0) = q_{00}

\delta(q_{00}, 1) = q_{01}

- Example:

\delta(q_{01}, 0) = q_{00}

\delta(q_{01}, 1) = q_{01}

- Example:

\delta(q_{11}, 0) = q_{10}

\delta(q_{11}, 1) = q_{11}

- Example:

\delta(q_{10}, 0) = q_{10}

\delta(q_{10}, 1) = q_{11}

Step 5: Identify initial and final states.

Initial state: $q_\varepsilon$ .

Final states: $q_\varepsilon, q_{00}, q_{11}$ (strings where first and last symbols are equal, or empty string).

Answer: The DFA is

M = (\{q_\varepsilon, q_{00}, q_{01}, q_{11}, q_{10}\}, \{0, 1\}, \delta, q_\varepsilon, \{q_\varepsilon, q_{00}, q_{11}\})

with the transitions defined above.

:::question type="MCQ" question="Construct a DFA over $\Sigma = \{a, b\}$ for the language of all strings where the number of $a$ 's is congruent to the number of $b$ 's modulo $3$ . Which of the following state definitions correctly captures the necessary information?" options=["States $(c_a, c_b)$ where $c_a$ is count of $a$ 's and $c_b$ is count of $b$ 's.","States $(r_a, r_b)$ where $r_a = (\#a) \pmod 3$ and $r_b = (\#b) \pmod 3$ .","States $(r_{a-b})$ where $r_{a-b} = (\#a - \#b) \pmod 3$ .","States $(q_0, q_1, q_2)$ where $q_i$ means $(\#a + \#b) \pmod 3 = i$ ." ] answer="States $(r_{a-b})$ where $r_{a-b} = (\#a - \#b) \pmod 3$ ." hint="The condition is $\#a \equiv \#b \pmod 3$ , which is equivalent to $\#a - \#b \equiv 0 \pmod 3$ . Therefore, we only need to track the difference modulo $3$ ." solution="Step 1: The condition is $\#a \equiv \#b \pmod 3$ . This can be rewritten as $\#a - \#b \equiv 0 \pmod 3$ .
We need to track the value of $(\#a - \#b) \pmod 3$ .

Step 2: Define states. Let $Q = \{q_0, q_1, q_2\}$ , where $q_i$ represents that $(\#a - \#b) \pmod 3 = i$ .

Step 3: Define transitions.

From $q_i$ on $a$ : The count of $a$ 's increases by $1$ . So, the new difference is $(i+1) \pmod 3$ .

From $q_i$ on $b$ : The count of $b$ 's increases by $1$ . So, the new difference is $(i-1) \pmod 3$ , which is $(i+2) \pmod 3$ .

Step 4: Initial and final states.

Initial state: $q_0$ (for $\varepsilon$ , $\#a=0, \#b=0 \implies 0-0=0 \pmod 3$ ).

Final state: $q_0$ (for $(\#a - \#b) \pmod 3 = 0$ ).

This DFA requires

3

states. The most efficient way to track this property is by tracking the difference modulo

3

, not separate counts modulo

3

for

a

and

b

, nor the sum."
:::

---

5. DFA State Minimization and Language Complexity

For any regular language, there is a unique (up to isomorphism) DFA with the minimum number of states. This minimum DFA is important for understanding the inherent complexity of a regular language. Myhill-Nerode theorem formally states that the number of states in the minimal DFA for a language $L$ is equal to the number of equivalence classes of the Myhill-Nerode relation for $L$ .

📖 Distinguishable States

Two states $p, q \in Q$ are distinguishable if there exists a string $w \in \Sigma^$ such that $\delta^$ (p, w) \in F $δ^{*} (p, w) \in F$ and $\delta^*(q, w) \notin F$ , or vice versa. Otherwise, they are indistinguishable or equivalent.

Worked Example: Minimize the following DFA $M = (\{q_0, q_1, q_2, q_3, q_4\}, \{0, 1\}, \delta, q_0, \{q_2, q_4\})$ .
Transitions:
$\delta(q_0, 0) = q_1, \delta(q_0, 1) = q_2$
$\delta(q_1, 0) = q_0, \delta(q_1, 1) = q_3$
$\delta(q_2, 0) = q_4, \delta(q_2, 1) = q_1$
$\delta(q_3, 0) = q_2, \delta(q_3, 1) = q_4$
$\delta(q_4, 0) = q_3, \delta(q_4, 1) = q_0$

Step 1: Partition states into $P_0$ based on final/non-final.
$P_0 = \{\{q_0, q_1, q_3\}, \{q_2, q_4\}\}$ (Non-final states, Final states)

Step 2: Refine partitions $P_k$ to $P_{k+1}$ by checking distinguishability for each group.
For a group $A \in P_k$ , states $p, q \in A$ are distinguishable if for some $a \in \Sigma$ , $\delta(p, a)$ and $\delta(q, a)$ are in different groups of $P_k$ .

Iteration 1: From $P_0$ to $P_1$

Group $\{q_0, q_1, q_3\}$ :

- Check

q_0, q_1

:
-

\delta(q_0, 0) = q_1 \in \{q_0, q_1, q_3\}

\delta(q_1, 0) = q_0 \in \{q_0, q_1, q_3\}

\delta(q_0, 1) = q_2 \in \{q_2, q_4\}

\delta(q_1, 1) = q_3 \in \{q_0, q_1, q_3\}

q_0

and

q_1

are distinguishable by input '1' because

q_2 \in F

and

q_3 \notin F

. This is incorrect. Both

q_2

and

q_3

are in different groups in

P_0

.
-

\delta(q_0, 1) = q_2 \in \{q_2, q_4\}

(Group F)
-

\delta(q_1, 1) = q_3 \in \{q_0, q_1, q_3\}

(Group NF)
- Since

q_2

and

q_3

are in different groups of

P_0

q_0

and

q_1

are distinguishable.
- Check

q_0, q_3

:
-

\delta(q_0, 0) = q_1 \in \{q_0, q_1, q_3\}

\delta(q_3, 0) = q_2 \in \{q_2, q_4\}

- Since

q_1

and

q_2

are in different groups of

P_0

q_0

and

q_3

are distinguishable.
- Check

q_1, q_3

:
-

\delta(q_1, 0) = q_0 \in \{q_0, q_1, q_3\}

\delta(q_3, 0) = q_2 \in \{q_2, q_4\}

- Since

q_0

and

q_2

are in different groups of

P_0

q_1

and

q_3

are distinguishable.
- All states in

\{q_0, q_1, q_3\}

are distinguishable from each other. They form singleton sets.

Group $\{q_2, q_4\}$ :

\delta(q_2, 0) = q_4 \in \{q_2, q_4\}

\delta(q_4, 0) = q_3 \in \{q_0, q_1, q_3\}

- Since

q_4

and

q_3

are in different groups of

P_0

q_2

and

q_4

are distinguishable.

P_1 = \{\{q_0\}, \{q_1\}, \{q_3\}, \{q_2\}, \{q_4\}\}

. No two states are equivalent. This DFA is already minimal.

Answer: The given DFA is already minimal. If there were equivalent states, we would merge them and update transitions.

⚠️ Common Mistake in Minimization

❌ Mistake: Only checking if states lead to final/non-final states for distinguishability.
✅ Correct approach: Two states $p, q$ are distinguishable if for some input $a$ , $\delta(p, a)$ and $\delta(q, a)$ fall into different groups of the current partition. The groups themselves can contain both final and non-final states in intermediate steps.

:::question type="MSQ" question="Which of the following languages over the alphabet $\{0,1\}$ are not recognized by a DFA with three states?" options=["Words which do not have $11$ as a contiguous subword","Binary representations of multiples of three","Words that have $11$ as a suffix","Words that do not contain $101$ as a contiguous subword"] answer="Words that do not contain $101$ as a contiguous subword" hint="For each language, try to construct a minimal DFA. If it requires more than three states, then it cannot be recognized by a 3-state DFA." solution="Let's analyze each option:

1. Words which do not have $11$ as a contiguous subword:

States needed:

q_0

: No

11

seen, last char not

1

.
-

q_1

: No

11

seen, last char was

1

.
-

q_f

11

seen (trap state, non-accepting).

This DFA requires 3 states. The accepting states are $q_0, q_1$ .

Example: $q_0 \xrightarrow{0} q_0$ , $q_0 \xrightarrow{1} q_1$ . $q_1 \xrightarrow{0} q_0$ , $q_1 \xrightarrow{1} q_f$ . $q_f \xrightarrow{0,1} q_f$ .
- This language can be recognized by a 3-state DFA.
2. Binary representations of multiples of three:
- This is a divisibility by $3$ problem. As shown in previous examples, for divisibility by $N$ , we need $N$ states to track remainders modulo $N$ .
- For $N=3$ , we need $3$ states ( $q_0, q_1, q_2$ for remainders $0, 1, 2$ ).
- This language can be recognized by a 3-state DFA.
3. Words that have $11$ as a suffix:
- States needed:
- $q_0$ : Initial state, no $1$ or $11$ suffix.
- $q_1$ : Last char was $1$ .
- $q_2$ : Last two chars were $11$ . (Accepting state)
- Example: $q_0 \xrightarrow{0} q_0$ , $q_0 \xrightarrow{1} q_1$ . $q_1 \xrightarrow{0} q_0$ , $q_1 \xrightarrow{1} q_2$ . $q_2 \xrightarrow{0} q_0$ , $q_2 \xrightarrow{1} q_2$ .
  - This DFA requires 3 states.
  - This language can be recognized by a 3-state DFA.
  4. Words that do not contain $101$ as a contiguous subword:
  - To recognize this, we need to track prefixes of $101$ that have been seen: $\varepsilon$ , $1$ , $10$ , $101$ .
  - States:
  - $q_\varepsilon$ : No prefix of $101$ seen.
  - $q_1$ : Last seen was $1$ .
  - $q_{10}$ : Last seen was $10$ .
  - $q_{101}$ : $101$ seen (trap state, non-accepting).
  - This requires 4 states. Let's verify.
  - $\delta(q_\varepsilon, 0) = q_\varepsilon$ , $\delta(q_\varepsilon, 1) = q_1$
  - $\delta(q_1, 0) = q_{10}$ , $\delta(q_1, 1) = q_1$ (reset to $1$ prefix)
  - $\delta(q_{10}, 0) = q_\varepsilon$ , $\delta(q_{10}, 1) = q_{101}$
  - $\delta(q_{101}, 0) = q_{101}$ , $\delta(q_{101}, 1) = q_{101}$
  - The accepting states would be $\{q_\varepsilon, q_1, q_{10}\}$ .
  - This minimal DFA requires 4 states. Therefore, it cannot be recognized by a 3-state DFA.
  The correct option is 'Words that do not contain $101$ as a contiguous subword'."
  :::
  
  ---
  
  6. DFA Operations: Complement, Union, and Intersection
  
  Regular languages are closed under complementation, union, and intersection. This means if $L_1$ and $L_2$ are regular languages, then so are $L_1^c$ , $L_1 \cup L_2$ , and $L_1 \cap L_2$ . We can construct DFAs for these operations.
  
  Complement:
  Given a DFA $M = (Q, \Sigma, \delta, q_0, F)$ for $L(M)$ , a DFA $M^c$ for $L(M)^c = \Sigma^* \setminus L(M)$ is simply $M^c = (Q, \Sigma, \delta, q_0, Q \setminus F)$ . We swap the final and non-final states.
  
  Union and Intersection (Product Construction):
  Given two DFAs $M_1 = (Q_1, \Sigma, \delta_1, q_{01}, F_1)$ and $M_2 = (Q_2, \Sigma, \delta_2, q_{02}, F_2)$ , we can construct a DFA $M = (Q, \Sigma, \delta, q_0, F)$ for $L(M_1) \cup L(M_2)$ or $L(M_1) \cap L(M_2)$ using the product construction.
  - $Q = Q_1 \times Q_2 = \{(q_i, q_j) \mid q_i \in Q_1, q_j \in Q_2\}$
  - $q_0 = (q_{01}, q_{02})$
  - $\delta((q_i, q_j), a) = (\delta_1(q_i, a), \delta_2(q_j, a))$
  - For Union ( $L(M_1) \cup L(M_2)$ ): $F = \{(q_i, q_j) \mid q_i \in F_1 \text{ or } q_j \in F_2\}$
  - For Intersection ( $L(M_1) \cap L(M_2)$ ): $F = \{(q_i, q_j) \mid q_i \in F_1 \text{ and } q_j \in F_2\}$
  Worked Example: Construct a DFA for the complement of $L(M)$ , where $M$ is the DFA from Section 2 (basic pattern recognition, $L = \{w \in \{a,b\}^* \mid w \text{ contains } ab \text{ as a substring}\})$ .
  $M = (\{q_0, q_1, q_2\}, \{a, b\}, \delta, q_0, \{q_2\})$ with transitions:
  $\delta(q_0, a) = q_1, \delta(q_0, b) = q_0$
  $\delta(q_1, a) = q_1, \delta(q_1, b) = q_2$
  $\delta(q_2, a) = q_2, \delta(q_2, b) = q_2$
  
  Step 1: Identify the original set of states $Q$ and final states $F$ .
  $Q = \{q_0, q_1, q_2\}$ , $F = \{q_2\}$ .
  
  Step 2: The complement DFA $M^c$ will have the same states, alphabet, transition function, and start state. Only the final states change.
  $F^c = Q \setminus F = \{q_0, q_1\}$ .
  
  Answer: The DFA for $L(M)^c$ is $M^c = (\{q_0, q_1, q_2\}, \{a, b\}, \delta, q_0, \{q_0, q_1\})$ , where $\delta$ is identical to that of $M$ . This DFA accepts strings that do not contain $ab$ as a substring.
  
  :::question type="MCQ" question="Let $L_1$ be the language of strings over $\{0,1\}$ with an even number of $0$ s, and $L_2$ be the language of strings over $\{0,1\}$ with an even number of $1$ s. What is the minimum number of states in a DFA that accepts $L_1 \cap L_2$ ?" options=["2","3","4","5"] answer="4" hint="Construct a DFA for $L_1$ and $L_2$ separately, then use the product construction for their intersection. The minimum number of states will be the size of the product automaton (if it's already minimal)." solution="Step 1: Construct DFA $M_1$ for $L_1$ (even number of $0$ s).
  - States: $Q_1 = \{q_{0E}, q_{0O}\}$ ( $0$ s are Even, $0$ s are Odd).
  - Start state: $q_{0E}$ . Final state: $F_1 = \{q_{0E}\}$ .
  - Transitions $\delta_1$ :
  - $\delta_1(q_{0E}, 0) = q_{0O}$
  - $\delta_1(q_{0E}, 1) = q_{0E}$
  - $\delta_1(q_{0O}, 0) = q_{0E}$
  - $\delta_1(q_{0O}, 1) = q_{0O}$
  This DFA has 2 states.
  
  Step 2: Construct DFA $M_2$ for $L_2$ (even number of $1$ s).
  - States: $Q_2 = \{q_{1E}, q_{1O}\}$ ( $1$ s are Even, $1$ s are Odd).
  - Start state: $q_{1E}$ . Final state: $F_2 = \{q_{1E}\}$ .
  - Transitions $\delta_2$ :
  - $\delta_2(q_{1E}, 0) = q_{1E}$
  - $\delta_2(q_{1E}, 1) = q_{1O}$
  - $\delta_2(q_{1O}, 0) = q_{1O}$
  - $\delta_2(q_{1O}, 1) = q_{1E}$
  This DFA has 2 states.
  
  Step 3: Use product construction for $L_1 \cap L_2$ .
  - States $Q = Q_1 \times Q_2 = \{(q_{0E}, q_{1E}), (q_{0E}, q_{1O}), (q_{0O}, q_{1E}), (q_{0O}, q_{1O})\}$ .
  - Start state: $(q_{0E}, q_{1E})$ .
  - Final states: $F = \{(q, r) \mid q \in F_1 \text{ and } r \in F_2\} = \{(q_{0E}, q_{1E})\}$ .
  - Transitions $\delta((q_a, q_b), \text{sym}) = (\delta_1(q_a, \text{sym}), \delta_2(q_b, \text{sym}))$ .
  - $\delta((q_{0E}, q_{1E}), 0) = (q_{0O}, q_{1E})$
  - $\delta((q_{0E}, q_{1E}), 1) = (q_{0E}, q_{1O})$
  - ... (all 8 transitions will be defined)
  
  Step 4: The product DFA has $2 \times 2 = 4$ states. Since all states are reachable and necessary to distinguish the different parities of $0$ s and $1$ s, this DFA is minimal.
  For example, $(q_{0E}, q_{1E})$ means both counts are even. $(q_{0O}, q_{1E})$ means $0$ s are odd, $1$ s are even. These are distinct conditions.
  
  Thus, the minimum number of states is 4."
  :::
  
  ---
  
  Advanced Applications
  
  Worked Example: Algorithm to check if a DFA $A$ accepts some word that extends a fixed word $u$ . (PYQ 9 adapted)
  A word $v$ extends $u$ if $v = xuy$ for some $x, y \in \Sigma^$ . We need to determine if $L(A) \cap (\Sigma^$ u \Sigma^*) $L (A) \cap (Σ^{*} u Σ^{*})$ is non-empty.
  
  Step 1: Compute the set of states reachable from the initial state of $A$ by any path. Let this be $R$ .
  This can be done using a graph traversal algorithm (e.g., BFS or DFS) starting from the initial state $q_0$ .
  
  Step 2: Compute the set of states from which a final state of $A$ is reachable by any path. Let this be $S$ .
  This can be done by constructing the reverse DFA $A^R$ (reversing all transitions and swapping start/final states), then finding states reachable from its start states (which are $F$ of $A$ ).
  
  Step 3: For every state $r \in R$ and every state $s \in S$ , check if there is a path labeled exactly $u$ from $r$ to $s$ .
  To do this, for each $r \in R$ , compute $\delta^*(r, u)$ . Let this be $r'$ . Then check if $r' \in S$ .
  
  Step 4: If such an $r \in R$ and $s \in S$ (where $s = \delta^*(r,u)$ ) exist, output 'Yes'. Otherwise, output 'No'.
  
  Justification:
  - If 'Yes', then there's a path $q_0 \xrightarrow{x} r \xrightarrow{u} s \xrightarrow{y} q_f$ (where $q_f \in F$ ). Thus $xuy \in L(A)$ .
    - If 'No', then no such path exists, meaning no word of the form $xuy$ is accepted by $A$ .
    Answer: The algorithm involves three reachability computations: forward from $q_0$ , backward from $F$ , and then specific path traversal for $u$ between the reachable/co-reachable states.
    
    :::question type="NAT" question="Consider the language $L = \{w \in \{0,1\}^* \mid w \text{ contains an odd number of } 0\text{s and ends with } 1\}$ . What is the minimum number of states in a DFA for $L$ ?" answer="4" hint="This is a combination of two properties: parity of $0$ s and suffix. Use states to track the combination of these two pieces of information. Ensure all combinations are distinct and reachable." solution="Step 1: Identify the properties to track.
  - Parity of $0$ s: Even ( $E_0$ ) or Odd ( $O_0$ ).
  - Last symbol: $0$ or $1$ . (We only care if it ends in $1$ , so we need to know if the last symbol was $0$ or $1$ ).
  - Number of $0$ s is even.
  - Total length of the string is odd.
  - New start state $q_0$ .
  - $\varepsilon$ -transitions from $q_0$ to the start states of $M_1$ ( $A$ ) and $M_2$ ( $C$ ).
  - The final states of the new NFA are the union of final states of $M_1$ and $M_2$ .

Finite Automata

Finite Automata

Chapter Contents

| Topic |

Part 1: Deterministic Finite Automata (DFA)

Core Concepts

1. Formal Definition of a DFA

2. DFA Construction: Basic Pattern Recognition

3. DFA Construction: Divisibility Problems

4. DFA Construction: Complex Pattern Tracking

5. DFA State Minimization and Language Complexity

6. DFA Operations: Complement, Union, and Intersection

Advanced Applications

Problem-Solving Strategies

Common Mistakes

Practice Questions

Summary

| Formula/Concept | Expression |

What's Next?

Part 2: Non-deterministic Finite Automata (NFA)

Core Concepts

1. Definition of NFA

2. NFA Transitions and Acceptance

3. NFA with ε\varepsilonε-Transitions (ε\varepsilonε-NFA)

Advanced Applications

1. Constructing NFAs for Regular Languages

2. Equivalence of NFA and DFA (Subset Construction)

3. Properties of NFAs

4. Variations of Finite Automata (Muller Automata)

5. Analyzing and Modifying NFAs

Problem-Solving Strategies

Common Mistakes

Practice Questions

Summary

| Formula/Concept | Expression |

What's Next?

Part 3: Equivalence of Automata

Core Concepts

1. Equivalence of NFA and DFA

1.1 Subset Construction Algorithm

2. Equivalence of Two DFAs

2.1 Table-Filling Algorithm for Equivalence

3. Equivalence of Regular Expressions and Finite Automata

3.1 Converting Regular Expression to NFA (ε\varepsilonε-NFA)

3.2 Converting Finite Automaton to Regular Expression

Chapter Summary

Chapter Review Questions

What's Next?

🎯 Key Points to Remember

Related Topics in Formal Languages and Automata Theory

Pushdown Automata (PDA)

Properties of Regular Languages

Introduction to Formal Languages

Turing Machines and Decidability

More Resources

Study Notes

Short Notes

Test Series

Mock Tests

Previous Year Papers

Chapter-wise PYQs

Chapter Practice

Why Choose MastersUp?

AI-Powered Plans

15,000+ Questions

Smart Analytics

Bookmark & Revise

3. NFA with $\varepsilon$ -Transitions ( $\varepsilon$ -NFA)

3.1 Converting Regular Expression to NFA ( $\varepsilon$ -NFA)