Introduction

AZ65 is a powerful but simple assembler for the Zilog Z80, MOS 6502, and Sharp LR35902 (sm83 / gbz80) architectures. In this book you'll learn the ins-and-outs of the assembler and its advanced meta-programming capabilities.

Note that this book only covers using AZ65 and does not cover assembly in general.

For assembly language references see:

z80.info for lots of z80 resources.
6502.org for plenty of 6502 tutorials.
gbdev.io for resources on sm83/gbz80 programming

Command Line Interface

All the functionality of AZ65 is in the az65 binary.

Assembling Files

Pass the target CPU architecture and the name of an assembly file to assemble it:

az65 6502 code.asm

Architectures

6502 for MOS 6502
z80 for Zilog Z80
sm83 for the Sharp LR35902 (a.k.a gbz80)

By default, az65 will write the assembled program to stdout. You can direct this to a file using the > operator:

az65 6502 code.asm > code.bin

You can alternatively use the -o option to pick an output file:

az65 z80 code.asm -o code.bin

Search Paths

AZ65 supports specifying search paths for locating files that are referenced in your code. Pass search paths by repeatedly using the -I option.

az65 sm83 code.asm -I macros -I data > code.bin

When a file is referenced by name it will be searched for in the same directory as the currently assembled file and if not found each include path will be checked in the order given.

Expressions

Number Formats

Binary

Binary numbers in AZ65 use standard 6502 binary number syntax. They are prefixed with a modulus (%). For example:

%00001111
%1010
%0

Hexadecimal

Like binary numbers, standard syntax is used. They are prefixed with a dollar-sign ($) and are case-insensitive. For example:

$1234
$DADcafe
$0

Decimal

Numbers without a % or $ prefix are assumed to be decimal (base 10) numbers.

Operators

All expressions in AZ65 operate on 32-bit signed integers with wrapping over/underflow semantics. All operators and their precedence match that of the C language with a few notable modifications:

There is no C binary comma (,) operator. It is mostly an anachronism that many C programmers aren't even aware exists.
The unary < and > operators, common in 6502 assembly, are present. They are used to get the low and high byte of a 16-bit word. For example:
- < $1234 evaluates to $34.
- > $1234 evaluates to $12.
There is a unary + operator. This is mainly used to disambiguate between expressions and memory locations in some assembly languages. For example, in z80 assembly the instruction ld a, ($42) is ambiguous. A programmer may intend for this to load the value $42 into a, but AZ65 will interpret this is loading a byte at address $0042 into a. To add clarity, you can use a unary + to indicate that you are passing a numeric expression rather than an address:
- ld a, +($42)
Unsigned (logical) shift operators are provided. Use the <<< and >>> symbols to shift left and right respectively:
- $ffffffff >>> 1 evaluates to $7fffffff
- $ffffffff <<< 1 evaluates to $fffffffe

Strings

All strings in AZ64 are UTF-8 encoded. They are written enclosed in double quotes ("):

"Hello World"
"not a number: 1234"
"Howdy, cowboy 🤠"

Use C-style escape sequences to write special characters inside a string:

"line break: \n"
"tab: \t"
"double-quote: \""

Multi-line strings can be written by placing a backslash immediately before the line break:

"multi\
line\
string"

To encode a byte directly, place a backslash before a hexadecimal number:

"capital Q: \$51"

Multicharacter Literals

AZ65 also supports the multicharacter literal that is present in C.

You can specify big-endian 32-bit values as a sequence of 1 to 4 ASCII characters enclosed in single quotes ('):

'a'
'yo'
'test'

Labels

There are 2 types of labels in AZ65:

Global Labels

Global labels are labels as you'd normally expect them in an assembler. They are alphanumeric tokens that are used to name addresses and constants.

GlobalLabel:
    jr GlobalLabel

Note that the use of colons (:) is optional.

Local Labels

Local labels are labels defined within the "scope" of a global label-- that is labels that are defined after a global label in your code. They look like global labels but begin with a dot (.):

GlobalLabel:
    nop
.LocalLabel:
    jr .LocalLabel

Local labels are really just syntactic sugar for writing longer fully-qualified labels. The example above is equivalent to this:

GlobalLabel:
    nop
GlobalLabel.LocalLabel:
    jp GlobalLabel.LocalLabel

This means that two global labels can have local labels with the same name and they will not conflict with each other.

It also means you can always refer to a local label by its full name. When written this way, they are referred to as "direct" labels.

Simple Directives

In assembly languages, "directives" are special commands that are used to control the behavior of the assembler. AZ65 directives are special tokens that begin with an at-sign (@).

There are two kinds of directives in AZ65: "simple" and "macro-like" directives. We'll cover the simple directives first since they are just like directives found in most other assemblers.

`@echo`

We'll start with the @echo directive since it is very useful for debugging and demonstrating future directives.

The @echo directive takes a single string or expression argument and prints it to stderr.

AZ65 Documentation