CS536 - Spring 2025, University of Wisconsin

For this assignment you will write a name analyzer for bach programs represented as abstract-syntax trees. Your main task will be to write name analysis methods for the nodes of the AST. In addition you will need to:

Specifications

Getting Started

Name Analysis

You must implement your name analyzer by writing appropriate methods for the different subclasses of ASTnode. Exactly what methods you write is up to you (as long as they do name analysis as specified).

It may help to start by writing the name analysis method for ProgramNode, then work "top down", adding a method for DeclListNode (the child of a ProgramNode), then for each kind of DeclNode (except StructDeclNode), and so on (and then handle StructDeclNode and perhaps other struct related nodes at the end). Be sure to think about which nodes' methods need to add a new hashtable to the symbol table (i.e., when is a new scope being entered) and which methods need to remove a hashtable from the symbol table (i.e., when is a scope being exited).

Some of the methods will process the declarations in the program (checking for bad declarations and checking whether the names are multiply declared, and if not, adding appropriate symbol-table entries) and some will process the statements in the program (checking that every name used in a statement has been declared and adding links). Note that you should not add a link for an IdNode that represents a use of an undeclared name.

struct Handling Issues

Error Reporting

Your name analyzer should find all of the errors described in the table given below; it should report the specified position of the error, and it should give exactly the specified error message (each message should appear on a single line, rather than how it is formatted in the following table). Error messages should have the same format as in the scanner and parser (i.e., they should be issued using a call to ErrMsg.fatal).

If a declaration is both "bad" (e.g., a non-function declared void) and is a declaration of a name that has already been declared in the same scope, you should give two error messages (first the "bad" declaration error, then the "multiply declared" error).

Type of Error	Error Message	Position to Report
More than one declaration of an identifier in a given scope (note: includes identifier associated with a `struct` definition)	`Identifier multiply-declared`	The first character of the ID in the duplicate declaration
Use of an undeclared identifier	`Identifier undeclared`	The first character of the undeclared identifier
Bad `struct` access (LHS of colon-access is not of a `struct` type)	`Colon-access of non-struct type`	The first character of the ID corresponding to the LHS of the colon-access.
Bad `struct` access (RHS of colon-access is not a field of the appropriate a `struct`)	`Name of struct field invalid`	The first character of the ID corresponding to the RHS of the colon-access.
Bad declaration (variable or parameter of type `void`)	`Non-function declared void`	The first character of the ID in the bad declaration.
Bad declaration (attempt to declare variable of a bad `struct` type)	`Name of struct type invalid`	The first character of the ID corresponding to the `struct` type in the bad declaration.

Note that the names themselves should not be printed as part of the error messages.

During name analysis, if a function name is multiply declared you should still process the formals and the body of the function; don't add a new entry to the current symbol table for the function, but do add a new hashtable to the front of the SymTab's list for the names declared in the body (i.e., the parameters and other local variables of the function).

If you find a bad variable declaration (a variable of type void or of a bad struct type), give an error message and add nothing to the symbol table.

Summary

Other Tasks

Extending the Sym Class

It is up to you how you store information in each symbol-table entry (each Sym). To implement the changes to the unparser described below you will need to know each name's type. For function names, this includes the return type and the number of parameters and their types. You can modify the Sym class by adding some new fields (e.g., a kind field) and/or by declaring some subclasses (e.g., a subclass for functions that has extra fields for the return type and the list of parameter types). You will probably also want to add new methods that return the values of the new fields and it may be helpful to change the toString method so that you can print the contents of a Sym for debugging purposes.

Modifying the IdNode Class

P4.java

Calling the name analyzer means calling the appropriate method of the ASTnode that is the root of the tree built by the parser.

Modifying the ErrMsg Class

Your compiler should quit after the name analyzer has finished if any errors have been detected so far (either by the scanner/parser or the name analyzer). To accomplish this, you can add a static boolean field to the ErrMsg class that is initialized to false and is set to true if the fatal method is ever called (warnings should not change the value of this field). Your main program can check the value of this field and only call the unparser if it is false.

Writing Test Inputs

Note that your nameErrors.bach should cause error messages to be output, so to know whether your name analyzer behaves correctly, you will need to know what output to expect.

As usual, you will be graded in part on how thoroughly your input files test your code.

Some Advice

Here are few words of advice about various issues that come up in the assignment:

Handing in

Turn in the following files to the appropriate assignment in Gradescope (note: these should be the only files changed/needed to run with the provided materials):

Please ensure that you do not turn in any sub-directories or put your Java files in any packages.

If you are working in a pair, make sure both partners are indicated when submitting to Gradescope.

Grading criteria

For more advice on Java programming style, see these style and commenting standards (which are essentially identical to the standards used in CS200 / CS300 / CS400).

Programming Assignment 4 (P4)

CS536-S25 Intro to PLs and Compilers

Due Tuesday, April 8 at 11:59 pm

Overview

Specifications

Getting Started

Name Analysis

`struct` Handling Issues

Error Reporting

Summary

Other Tasks

Extending the `Sym` Class

Modifying the `IdNode` Class

P4.java

Modifying the `ErrMsg` Class

Writing Test Inputs

Some Advice

Handing in

Grading criteria

Programming Assignment 4 (P4) CS536-S25 Intro to PLs and Compilers

Due Tuesday, April 8 at 11:59 pm

Overview

Specifications

Getting Started

Name Analysis

struct Handling Issues

Error Reporting

Summary

Other Tasks

Extending the Sym Class

Modifying the IdNode Class

P4.java

Modifying the ErrMsg Class

Writing Test Inputs

Some Advice

Handing in

Grading criteria

Programming Assignment 4 (P4)

CS536-S25 Intro to PLs and Compilers

`struct` Handling Issues

Extending the `Sym` Class

Modifying the `IdNode` Class

Modifying the `ErrMsg` Class