Context-Free Languages

Welp, we’ve proven a lot of languages aren’t regular. Any language that requires counting cannot be regular, since they do not require a constant amount of state.

The class of context-free languages is an intermediate class between the class of regular languages and the class of decidable languages, meaning it is a strict superset of the regular languages

Context-free languages are described by context-free grammars
This class includes N-PDA and the less powerful D-PDA

Instead of writing out a system to recognize languages, we make a system to generate strings inductively. To recognize the language $a^{n} b^{n}$ , we start with a a basis is that $ϵ$ is in the language, and use the recursive step " $w \in A \to a w b \in A$ ".

A grammar is a set of substitution rules for recursively generating strings

Definition: A context-free grammar is a $4$ -tuple $(V, Σ, R, S)$ , where:

$V$ is a finite set called the variables
$Σ$ is a finite set, disjoint from $V$ , called the terminals
$R$ is a finite set of rules taking the form $v \to$ a string of variables and terminals
$S \in V$ is the start symbol

Example: The language ${a^{n} b^{n}}$ for $n \geq 0$
${a, b}$ , ${S}$ , $S$ , $S \to a S b ∣ ϵ$

We may apply rules recursively however we’d like until all symbols are terminal. This is very powerful.

A derivation of a string is leftmost if we always expand the leftmost variable. Two derivations are different $⟺$ the leftmost (or rightmost) derivation are different.

A parse tree is a tree encoding the steps in a derivation, where leaves are labeled by a terminal or $ϵ$ , interior nodes are labeled by a variable, and the root is the start symbol. This is an non-ordered representation of a derivation. We can traverse the leaves with pre-order to get the correct language.

Theorem: For every parse tree, there is a unique leftmost and a unique rightmost derivation

On the topic of ambiguous grammars, Antonella is very amused by “Let’s eat_,_ Grandma”

A grammar is ambiguous if it can yield a string in multiple ways (uniqueness defined as above), or in other words, if there is a string in the language with two different leftmost (or rightmost) derivations. We can make ambiguous grammars for any language, but many have good non-ambiguous grammars. However, some languages are inherently ambiguous (for example, ${a^{i} b^{j} c^{k} ∣ i = j or j = k}$ ).

Example:
$S \to S + S ∣ S \cdot S$ is ambiguous
$S \to S + s ∣ S \cdot s$ is not

In Chomsky Normal Form, every rule takes the form of $A \to BC$ or $A \to a$
To convert a CFG to CNF,

Add a new start symbol $S_{0} \to S$
Determine the nullable variables and remove the null transitions
Eliminate unit productions
Replace long productions with shorter ones
Move terminals to unit productions

Swapping steps 4 and 5 works too

What closure properties can we rely on for context-free languages?

Union
Concatenation
Star
~~Intersection~~
~~Complement~~

Pushdown Automata

PDAs are equivalent in specification power with CFG, however they recognize a context-free language instead of generating one

They are similar to N-DFA with the addition of a stack

Definition: A pushdown automaton is a 6-tuple ( $Q, Σ, Γ, δ, q_{0}, F$ ), where $Q, Σ, Γ,$ and $F$ are all finite sets, and

$Q$ is the set of states,
$Σ$ is the input alphabet
$Γ$ is the stack alphabet
$δ : Q \times Σ_{e} \times Γ_{e} \to P (Q \times Γ_{e})$ is the transition function,
$q_{0} \in Q$ is the start state, and
$F \subseteq Q$ is the set of accept states.

Some CFL are more easily described one way or another

We can also derive relationships between a CFL and RL. For example, $CFL \cap RL = CFL$ . To see this, consider crossing a pushdown automata with a DFA.

Theorem: A language is context free $⟺$ some pushdown automaton recognizes it.

It’s not too hard to construct a PDA that mimics a CFG. The other way around is slightly trickier though.

Non-Context-Free Languages

Turns out there are also limitations to context-free languages

There is a pumping lemma on context-free languages, stating that all longer strings can be “pumped”, which in this context means dividing the string into five parts (so that the second and fourth may be repeated together any number of times)

Theorem: If $A$ is a context-free language, $\exists p$ where $\forall s \in A$ such that $∣ s ∣ \geq p$ , $s = uvx yz$ where $\forall i \geq 0$ , $u v^{i} x y^{i} z \in A$ , $∣ v y ∣ > 0$ , and $∣ vx y ∣ \leq p$

This takes slightly more advanced reasoning than the DFA pumping lemma, but ultimately uses the pigeon-hole principle in a similar way

Deterministic CFL

Deterministic PDAs are actually weaker than the PDAs we’ve examined so far. We still allow the use of $ϵ$ transitions but there can only ever be one valid transition at a time.

Definition: A deterministic pushdown automaton is a 6-tuple,

$Q$ is the set of states,
$Σ$ is the input alphabet,
$Γ$ is the stack alphabet,
$δ : Q \times Σ_{ϵ} \times Γ_{ϵ} \to (Q \times Γ_{ϵ}) \cup {\emptyset}$ is the transition function,
$q_{0} \in Q$ is the start state, and
$F \subseteq Q$ is the set of accept states.

Exactly one of the states $δ (q, a, x), δ (q, a, ϵ), δ (q, ϵ, x), δ (q, ϵ, ϵ)$ is not $\emptyset$ .

Acceptance occurs as in PDAs. If a DPDA enters an accept state after it has read the last input symbol of an input string, it accepts that string. Rejection occurs if the DPDA doesn’t arrive at an accept state at the end, tries to pop an empty stack (hangs), or makes an endless sequence of $ϵ$ -input moves (loops).

Definition: The language of a DPDA is called a deterministic context-free language

To reason more easily about DPDAs, we can first convert any DPDA into one that reads the entire input string.

Theorem: The class of DCFLs is closed under complementation

To see this, we modify a DPDA so that it hits at most one accept state per input symbol (by splitting reading and popping states). Then we invert the reading states.

This theorem implies that some CFLs are not DCFLs. If a CFL’s complement is not a CFL then that CFL is not a DCFL (think $A = {a^{i} b^{j} c^{k} ∣ i \neq = j or j \neq = k}$ )

DCFLs are not closed under union, intersection, star, and reversal

As a useful tool, we use the fact that $A$ is a DCFL $⟺$ $A ⊣$ (with an endmark) is a DCFL

Deterministic pushdown automata are equivalent in power to deterministic context-free grammars, provided we restrict our attention to endmarked languages

Deterministic CFGs are defined in terms of reductions instead of “generations”. If $u$ and $v$ are strings of variables and terminals, we write $u ↣ v$ to mean $v$ can be obtained from $u$ via a reduce step. $u ↣ * v$ means $v$ can be obtained from $u$ through a series of reduce steps.

In a CFG with start variable $S$ and string $w$ , we write a leftmost reduction of $w$ as $w = u_{1} ↣ u_{2} ↣ \dots ↣ u_{k} = S$ . We say that $w$ determines its entire leftmost reduction, which implies the grammar is unambiguous. This is an important but not sufficient requirement.

$x h y ↣ x T y$ and $T \to h$ means $h$ is a handle of $u_{i}$

Example: Consider
$R \to S ∣ T$
$S \to a S b ∣ ab$
$T \to a T bb ∣ abb$

This is the language of ${a^{m} b^{m}} \cup {a^{m} b^{2 m}}$

It should be clear that this is not a DCFL
But if we use $R \to 1 S ∣ 2 T$ then it is

The key idea is we need handles to be restricted so that a DPDA can identify them easily. As in, one must be able to find the handle without knowing the reduce step in advance.

We introduce the requirement that the reduce step is uniquely determined by the prefix of $u_{i}$ up through and including the reducing string $h$ of that reduce step. In other words, we say that $h$ is a forced handle if $h$ is the unique handle in every valid string $x h Σ^{*}$

Definition: A deterministic context-free grammar is a context-free grammar such that every valid string has a forced handle

For simplicity, we say $S$ isn’t on the right-hand side of any rule, and that there are no unused variables

This is a good definition but doesn’t make it obvious how to determine whether a CFG is deterministic

The DK-test says that for any CFG we can construct an associated DFA $DK$ to identify handles

Each state of the DFA represents our matching progress for each rule. We include transitions for each terminal and symbol. When we reach a symbol, we expand it by each rule.

This test fails,

This test passes (note the bottom right box should be $T \to T (T) .$ ),

We have a DCFG if each accept state has,

Exactly one completed rule
No dotted rule in which a terminal symbol immediately follows the dot

Note our definition of a forced handle.

At each step, we run the current string through the machine to end up in a single accept state. If an accept state has multiple rules then we wouldn’t know which one to apply. If an accept state has a terminal symbol transition then we’d be able to move on to another rule, and the result would still be ambiguous. However, if we can transition out of the accept state with a symbol then that previous state is not actually a forced handle, so we safely move on.

Any DCFG can be converted to a DPDA, but only endmarked languages can be converted from a DPDA to a DCFG

Theorem: An endmarked language is generated by a DCFG $⟺$ it is a DCFL

To make a DPDA for a DCFG, we essentially simulate $DK$ , keeping its states on the stack, and using that to iteratively reduce any input string. Once we reduce to the start state $S$ , we accept.

To do the other way around, is much more complicated and involves the $DK$ -test

LR(K) Grammars

DCFGs are great but can be a bit inconvenient with their restrictions

$L R (k)$ grammars are a broader class of grammars, close to DCFGs, that can be directly converted into DPDAs

Definition: In an $L R (k)$ grammar, a handle is forced according to the $k$ symbols following the handle

Left to right input processing
Rightmost derivations
$k$ symbols of lookahead

Therefore, a DCFG is the same as an $L R (0)$ grammar

We can convert any $L R (k)$ grammar to DPDA with a $D K_{k}$ test
A $D K_{k}$ test works quite similar to a $DK$ test, but we keep track of the possible $k$ length strings that could follow a handle after reducing. This lets us track ambiguities in a procedural way.

Binyamin's Notes

Explorer

Pushdown Automata

Non-Context-Free Languages

Deterministic CFL

LR(K) Grammars

Table of Contents