Description
The purpose of this lab assignment is to introduce the C programming language, including standard input-output
functions, command line arguments, File IO, and compilation with Makefiles.
Introduction to C
If you are not already familiar with C (or even if you are) it is recommended that you purchase a good C
reference such as C for Java Programmers: a Primer by Charlie McDowell (Lulu.com 2007). The C
programming language is in a certain sense the grandparent of Java (C++ being its parent). Java is known as an
Object Oriented Programming (OOP) language, which means that data structures and the procedures which
operate on them are grouped together into one language construct, namely the class. Common behavior
amongst classes is specified explicitly through the mechanism of inheritance. The C programming language on
the other hand does not directly support OOP, although it can be implemented with some effort. C is known as
a procedural programming language, which means that data structures and functions (i.e. procedures) are
separate language constructs. There are no classes, no objects, and no inheritance. New data types in C are
created using the typedef and struct constructs, which will be illustrated in future lab assignments. There is
however much common syntax between Java and C. Many control structures such as loops (while, do-while,
for), and branching (if, if-else, switch) are virtually identical in the two languages. One major difference is in
the way program input and output is handled, both to and from standard IO devices (keyboard and screen), and
to and from files. The following is an example of a “Hello World!” program in C.
Example
/*
* hello.c
* Prints “Hello World!” to stdout
*/
#include
int main(){
printf(“Hello World!\n”);
return 0;
}
Comments in C are specified by bracketing them between the strings /* and */, and may span several lines.
For instance /* comment */ or
/* comment
comment */
or
/*
* comment
* comment
*/
are all acceptable. With the right compiler flags, Java/C++ style comments are also acceptable.
// comment
// comment
2
You may use any style you like, but throughout this document we will use the older C style /*comments*/.
Any line beginning with # is known as a preprocessor directive. The preprocessor performs the first phase of
compilation wherein these directives, which are literal text substitutions, are performed making the program
ready for later stages of compilation. The line #include
stdio.h that specifies functions for performing standard input-output operations. Notice that preprocessor
commands in C do not end in a semicolon. One can also specify constant macros using the #define
preprocessor directive as follows.
/*
* hello.c
* Prints “Hello World!” to stdout
*/
#include
#define HELLO_STRING “Hello World!\n”
int main(){
printf(HELLO_STRING);
return 0;
}
Although you can call most C programs anything you want, each C program must contain exactly one function
called main(). Function main() is the point where program execution begins. Typically main() will call
other functions, which may in turn call other functions. Function definitions in C look very much like they do
in Java.
returnType functionName(dataType variableName, dataType variableName,. . .){
/* declarations of local variables */
/* executable statements */
/* return statement (provided return-type is not void) */
}
Recall that Java allows variable declarations to be interspersed among executable statements. Whether this is
allowable in C depends on which version of the language you are using. In particular the ANSI standard
requires that local variables be declared at the beginning of the function body and before any other statements.
The version of the C language we will use (C99) is not so strict on this point, but I recommend that students
adhere to the earlier standard (even in Java programs), since it can help organize one’s thinking about program
variables.
The function printf() prints formatted text to stdout. It’s first argument is known as a format string and
consists of two types of items. The first type is made up of characters that will be printed to the screen. The
second type contains format commands that define the way the remaining arguments are displayed. A format
command begins with a percent sign and is followed by the format code. There must be exactly the same
number of format commands as there are remaining arguments, and the format commands are matched with the
remaining arguments in order. For example
printf(“There are %d days in %s\n”, 30, “April”);
prints the text “There are 30 days in April”. Some common format commands are:
%c character
%d signed decimal integer
%f decimal floating point number
3
%s string (same as char array)
%e scientific notation
%% prints a percent sign
See a good reference on C for other format commands. (Note java contains a method System.out.printf
that behaves exactly like printf in C. There is a lengthy discussion of this function in the documentation for
java.io.PrintStream.)
Observe that the above main() function has return type int. A return value of 0 indicates to the caller (i.e.
the operating system) that execution was nominal and without errors. Actually the proper exit values for main
are system dependent. For the sake of portability one should use the exit codes EXIT_SUCCESS and
EXIT_FAILURE which are predefined constants found in the library header file stdlib.h. The exit code
allows for the status of the main program to be tested at the level of the operating system. (In Unix the exit
code will be stored in the shell variable $status.)
/*
* hello.c
* Prints “Hello World!” to stdout
*/
#include
#include
#define HELLO_STRING “Hello World!\n”
int main(){
printf(HELLO_STRING);
return EXIT_SUCCESS;
}
Compiling a C program
A C program may be comprised of any number of source files. Each source file name must end with the
extension .c. There are four components to the compilation process:
Preprocessor
Compiler
Assembler
Linker
The preprocessor expands symbolic constants in macro definitions and inserts library header files. The
compiler creates assembly language code corresponding to the instructions in the source file. The assembler
translates the assembly code into native machine readable object code. One object file is created for each
source file. Each object file has the same name as the corresponding source file with the .c extension replaced
by .o. The linker searches specified libraries for functions that the program uses (such as printf() above)
and combines pre-compiled object code for those functions with the program’s object code. The finished
product is a complete executable binary file. Most Unix systems provide several C compilers. We will use the
gcc compiler exclusively in this course. To create the object file hello.o do
% gcc –c –std=c99 –Wall hello.c
(Remember % stands for the Unix prompt.) The –c option means compile only (i.e. do not link), -std=c99
means use the C99 language standard, and –Wall means print all warnings. To link hello.o to the standard
library functions in stdio.h and stdlib.h do
% gcc –o hello hello.o
4
The –o option specifies hello as the name of the executable binary file to be created. If this option is left out
the executable will be named a.out. To run the program type its name at the command prompt.
% hello
Hello World!
The whole four step process can be automated by doing
%gcc –std=c99 –Wall –o hello hello.c
which automatically deletes the object file hello.o after linking. If the program resides in just one source
file, this may be the simplest way to compile. If the program is spread throughout several source files (as will
often be the case in this course) a Makefile should be used to automate compilation. Here is a simple Makefile
for hello.c.
# Makefile for hello.c
hello : hello.o
gcc -o hello hello.o
hello.o : hello.c
gcc -c –std=c99 -Wall hello.c
clean :
rm -f hello hello.o
Here is an equivalent Makefile that uses macros.
# Makefile for hello.c with macros
FLAGS = -std=c99 -Wall
SOURCES = hello.c
OBJECTS = hello.o
EXEBIN = hello
all: $(EXEBIN)
$(EXEBIN) : $(OBJECTS)
gcc -o $(EXEBIN) $(OBJECTS)
$(OBJECTS) : $(SOURCES)
gcc -c $(FLAGS) $(SOURCES)
clean :
rm -f $(EXEBIN) $(OBJECTS))
See the examples page for a fancy hello world program in C that prints out operating system environment
variables.
Command Line Arguments
The main function in a C program may have two possible prototypes:
int main();
int main(int argc, char* argv[]);
5
The second prototype is used for C programs that use command line arguments. The first argument argc
specifies the number of string arguments on the command line (including the program name.) The second
argument is an array containing the string arguments themselves. (Note that in C a string is just a char array
and is specified by the type char*.) The parameter argc is necessary since arrays in C do not know their own
lengths, unlike in Java. The following program behaves similarly to its namesake in lab2.
/*
* CommandLineArguments.c
*/
#include
#include
int main(int argc, char* argv[]){
int i;
printf(“argc = %d\n”, argc);
for(i = 0; i
#include
int main(){
int x, y, z;
printf(“Enter three integers separated by commas, then press return: “);
scanf(” %d, %d, %d”, &x, &y, &z);
printf(“The integers entered were %d, %d, %d\n”, x, y, z);
return EXIT_SUCCESS;
}
A sample run of this program looks like
% a.out
Enter three integers separated by commas, then press return: 12, -7, 13
The integers entered were 12, -7, 13
Running again, but this time leaving out the separating commas in the input line gives
% a.out
Enter three integers separated by commas, then press return: 12 -7 13
The integers entered were 12, 4, -4261060
Since the comma separating the first and second integers was left out scanf() read the first integer, then
expected to read and discard a comma but failed to do so, then just returned without reading anything else. The
values printed for variables y and z are random and may be different when you run the program. Thus we see
that scanf() is intended for reading formatted input. This is what the “f” it its name stands for. The
scanf() function returns a number equal to the number of fields that were successfully assigned. That number
can be tested and used within the program, as the following example illustrates.
7
#include
#include
int main(){
int n, i; int x[3];
printf(“Enter three integers separated by spaces, then press return: “);
n = scanf(” %d %d %d”, &x[0], &x[1], &x[2]);
printf(“%d numbers were successfully read: “, n);
for(i=0; i
#include
int main(int argc, char* argv[]){
FILE* in; /* file handle for input */
FILE* out; /* file handle for output */
char word[256]; /* char array to store words from input file */
/* check command line for correct number of arguments */
8
if( argc != 3 ){
printf(“Usage: %s

