2620 Notes 1.2 (8/29) | Saahil Gupta

Encoding Problems

Before we can define mathematical computation models(like Turing machines) that correspond to machines, we need a way to encode these problems.

We can observe that an instance of any problem can always be encoded by a sequence of 0s and 1s. Rather, we can use a combination of symbols to encode a problem.

The finite set of symbols and characters used for encoding is called the alphabet $\Sigma$. For example, $\Sigma = {0, 1}$.
A string $w$ over an alphabet $\Sigma$ is a finite sequence of those symbols.
$\Sigma^*$ Represents the set of all strings over $\Sigma$.
A language represents a particular set of strings over an alphabet.

Operations Over Strings

Many operations can be performed on strings to modify them:

Given strings $u$ and $v$, $u.v$ denotes their concatenation.
A string $u$ is a prefix of string $w$ if there exists a string $v$ such that $w = u.v$.
A string $u$ is a suffix of string $w$ if there exists a string $v$ such that $w = v.u$.
A string $u$ is a substring of string $w$ if there exists a string $v$ and $v’$ such that $w = v.u.v’$.

Finite State Automata

Automaton $M$ has a finite number of states. It begins with an initial state, and based on the symbol currently being processed.

Let’s construct an automaton for the language $A = { w$ | $w$ contains an even number of a’s$}$, where $\Sigma = {a, b}$.

Let us define two states, $q_1, q_2$. Start with the initial state, $q_1$. On a given character, if it is $a$, we transition from $q_1 \to q_2$, or $q_2 \to q_1$. Otherwise, we stay in the current state.

If by the end of the string, we are in state $q_1$, then $w$ has an even number of a’s and belongs in $A$. This type of machine is called a deterministic finite automaton, or DFA.

The transitions of this state is represented by a transition function $\delta$. If $M$ reads the symbol $x$ on current state $y$, then the next state will be $\delta(y, x)$.

A language $A$ is called a regular language if it can be solved by a DFA.