CMPS 12M Introduction to Data Structures Lab Lab Assignment 3 solution

$24.99

Original Work ?
Category: You will Instantly receive a download link for .ZIP solution file upon Payment

Description

5/5 - (4 votes)

The purpose of this lab assignment is to introduce the C programming language, including standard input-output
functions, command line arguments, File IO, and compilation with Makefiles.
Introduction to C
If you are not already familiar with C (or even if you are) it is recommended that you purchase a good C
reference such as C for Java Programmers: a Primer by Charlie McDowell (Lulu.com 2007). The C
programming language is in a certain sense the grandparent of Java (C++ being its parent). Java is known as an
Object Oriented Programming (OOP) language, which means that data structures and the procedures which
operate on them are grouped together into one language construct, namely the class. Common behavior
amongst classes is specified explicitly through the mechanism of inheritance. The C programming language on
the other hand does not directly support OOP, although it can be implemented with some effort. C is known as
a procedural programming language, which means that data structures and functions (i.e. procedures) are
separate language constructs. There are no classes, no objects, and no inheritance. New data types in C are
created using the typedef and struct constructs, which will be illustrated in future lab assignments. There is
however much common syntax between Java and C. Many control structures such as loops (while, do-while,
for), and branching (if, if-else, switch) are virtually identical in the two languages. One major difference is in
the way program input and output is handled, both to and from standard IO devices (keyboard and screen), and
to and from files. The following is an example of a “Hello World!” program in C.
Example
/*
* hello.c
* Prints “Hello World!” to stdout
*/
#include
int main(){
printf(“Hello World!\n”);
return 0;
}
Comments in C are specified by bracketing them between the strings /* and */, and may span several lines.
For instance /* comment */ or
/* comment
comment */
or
/*
* comment
* comment
*/
are all acceptable. With the right compiler flags, Java/C++ style comments are also acceptable.
// comment
// comment
2
You may use any style you like, but throughout this document we will use the older C style /*comments*/.
Any line beginning with # is known as a preprocessor directive. The preprocessor performs the first phase of
compilation wherein these directives, which are literal text substitutions, are performed making the program
ready for later stages of compilation. The line #include inserts the standard library header file
stdio.h that specifies functions for performing standard input-output operations. Notice that preprocessor
commands in C do not end in a semicolon. One can also specify constant macros using the #define
preprocessor directive as follows.
/*
* hello.c
* Prints “Hello World!” to stdout
*/
#include
#define HELLO_STRING “Hello World!\n”
int main(){
printf(HELLO_STRING);
return 0;
}
Although you can call most C programs anything you want, each C program must contain exactly one function
called main(). Function main() is the point where program execution begins. Typically main() will call
other functions, which may in turn call other functions. Function definitions in C look very much like they do
in Java.
returnType functionName(dataType variableName, dataType variableName,. . .){
/* declarations of local variables */
/* executable statements */
/* return statement (provided return-type is not void) */
}
Recall that Java allows variable declarations to be interspersed among executable statements. Whether this is
allowable in C depends on which version of the language you are using. In particular the ANSI standard
requires that local variables be declared at the beginning of the function body and before any other statements.
The version of the C language we will use (C99) is not so strict on this point, but I recommend that students
adhere to the earlier standard (even in Java programs), since it can help organize one’s thinking about program
variables.
The function printf() prints formatted text to stdout. It’s first argument is known as a format string and
consists of two types of items. The first type is made up of characters that will be printed to the screen. The
second type contains format commands that define the way the remaining arguments are displayed. A format
command begins with a percent sign and is followed by the format code. There must be exactly the same
number of format commands as there are remaining arguments, and the format commands are matched with the
remaining arguments in order. For example
printf(“There are %d days in %s\n”, 30, “April”);
prints the text “There are 30 days in April”. Some common format commands are:
%c character
%d signed decimal integer
%f decimal floating point number
3
%s string (same as char array)
%e scientific notation
%% prints a percent sign
See a good reference on C for other format commands. (Note java contains a method System.out.printf
that behaves exactly like printf in C. There is a lengthy discussion of this function in the documentation for
java.io.PrintStream.)
Observe that the above main() function has return type int. A return value of 0 indicates to the caller (i.e.
the operating system) that execution was nominal and without errors. Actually the proper exit values for main
are system dependent. For the sake of portability one should use the exit codes EXIT_SUCCESS and
EXIT_FAILURE which are predefined constants found in the library header file stdlib.h. The exit code
allows for the status of the main program to be tested at the level of the operating system. (In Unix the exit
code will be stored in the shell variable $status.)
/*
* hello.c
* Prints “Hello World!” to stdout
*/
#include
#include
#define HELLO_STRING “Hello World!\n”
int main(){
printf(HELLO_STRING);
return EXIT_SUCCESS;
}
Compiling a C program
A C program may be comprised of any number of source files. Each source file name must end with the
extension .c. There are four components to the compilation process:
Preprocessor

Compiler

Assembler

Linker
The preprocessor expands symbolic constants in macro definitions and inserts library header files. The
compiler creates assembly language code corresponding to the instructions in the source file. The assembler
translates the assembly code into native machine readable object code. One object file is created for each
source file. Each object file has the same name as the corresponding source file with the .c extension replaced
by .o. The linker searches specified libraries for functions that the program uses (such as printf() above)
and combines pre-compiled object code for those functions with the program’s object code. The finished
product is a complete executable binary file. Most Unix systems provide several C compilers. We will use the
gcc compiler exclusively in this course. To create the object file hello.o do
% gcc –c –std=c99 –Wall hello.c
(Remember % stands for the Unix prompt.) The –c option means compile only (i.e. do not link), -std=c99
means use the C99 language standard, and –Wall means print all warnings. To link hello.o to the standard
library functions in stdio.h and stdlib.h do
% gcc –o hello hello.o
4
The –o option specifies hello as the name of the executable binary file to be created. If this option is left out
the executable will be named a.out. To run the program type its name at the command prompt.
% hello
Hello World!
The whole four step process can be automated by doing
%gcc –std=c99 –Wall –o hello hello.c
which automatically deletes the object file hello.o after linking. If the program resides in just one source
file, this may be the simplest way to compile. If the program is spread throughout several source files (as will
often be the case in this course) a Makefile should be used to automate compilation. Here is a simple Makefile
for hello.c.
# Makefile for hello.c
hello : hello.o
gcc -o hello hello.o
hello.o : hello.c
gcc -c –std=c99 -Wall hello.c
clean :
rm -f hello hello.o
Here is an equivalent Makefile that uses macros.
# Makefile for hello.c with macros
FLAGS = -std=c99 -Wall
SOURCES = hello.c
OBJECTS = hello.o
EXEBIN = hello
all: $(EXEBIN)
$(EXEBIN) : $(OBJECTS)
gcc -o $(EXEBIN) $(OBJECTS)
$(OBJECTS) : $(SOURCES)
gcc -c $(FLAGS) $(SOURCES)
clean :
rm -f $(EXEBIN) $(OBJECTS))
See the examples page for a fancy hello world program in C that prints out operating system environment
variables.
Command Line Arguments
The main function in a C program may have two possible prototypes:
int main();
int main(int argc, char* argv[]);
5
The second prototype is used for C programs that use command line arguments. The first argument argc
specifies the number of string arguments on the command line (including the program name.) The second
argument is an array containing the string arguments themselves. (Note that in C a string is just a char array
and is specified by the type char*.) The parameter argc is necessary since arrays in C do not know their own
lengths, unlike in Java. The following program behaves similarly to its namesake in lab2.
/*
* CommandLineArguments.c
*/
#include
#include
int main(int argc, char* argv[]){
int i;
printf(“argc = %d\n”, argc);
for(i = 0; i
#include
int main(){
int x, y, z;
printf(“Enter three integers separated by commas, then press return: “);
scanf(” %d, %d, %d”, &x, &y, &z);
printf(“The integers entered were %d, %d, %d\n”, x, y, z);
return EXIT_SUCCESS;
}
A sample run of this program looks like
% a.out
Enter three integers separated by commas, then press return: 12, -7, 13
The integers entered were 12, -7, 13
Running again, but this time leaving out the separating commas in the input line gives
% a.out
Enter three integers separated by commas, then press return: 12 -7 13
The integers entered were 12, 4, -4261060
Since the comma separating the first and second integers was left out scanf() read the first integer, then
expected to read and discard a comma but failed to do so, then just returned without reading anything else. The
values printed for variables y and z are random and may be different when you run the program. Thus we see
that scanf() is intended for reading formatted input. This is what the “f” it its name stands for. The
scanf() function returns a number equal to the number of fields that were successfully assigned. That number
can be tested and used within the program, as the following example illustrates.
7
#include
#include
int main(){
int n, i; int x[3];
printf(“Enter three integers separated by spaces, then press return: “);
n = scanf(” %d %d %d”, &x[0], &x[1], &x[2]);
printf(“%d numbers were successfully read: “, n);
for(i=0; i
#include
int main(int argc, char* argv[]){
FILE* in; /* file handle for input */
FILE* out; /* file handle for output */
char word[256]; /* char array to store words from input file */
/* check command line for correct number of arguments */
8
if( argc != 3 ){
printf(“Usage: %s \n”, argv[0]);
exit(EXIT_FAILURE);
}
/* open input file for reading */
in = fopen(argv[1], “r”);
if( in==NULL ){
printf(“Unable to read from file %s\n”, argv[1]);
exit(EXIT_FAILURE);
}
/* open output file for writing */
out = fopen(argv[2], “w”);
if( out==NULL ){
printf(“Unable to write to file %s\n”, argv[2]);
exit(EXIT_FAILURE);
}
/* read words from input file, print on separate lines to output file*/
while( fscanf(in, ” %s”, word) != EOF ){
fprintf(out, “%s\n”, word);
}
/* close input and output files */
fclose(in);
fclose(out);
return(EXIT_SUCCESS);
}
Notice that by inserting the leading space in the format string for fscanf(), we skip all intervening whitespace
characters and thus parse the tokens in the file. (Recall a token is a substring that is maximal with respect to the
property of containing no whitespace characters.)
What to turn in
Write a C program called FileReverse.c that behaves exactly like the program FileReverse.java that
you wrote in lab 2. Thus FileReverse.c will take two command line arguments naming the input and output
files respectively (following the FileIO.c example above.) Your program will read each word in the input
file, then print it backwards on a line by itself. For example given a file called in containing the lines:
abc def ghij
klm nopq rstuv
w xyz
the command % FileReverse in out will create a file called out containing the lines:
cba
fed
jihg
mlk
qpon
vutsr
w
zyx
Your program will contain a function called stringReverse() with the heading
9
void stringReverse(char* s)
which reverses its string argument. Place the definition of this function after all preprocessor directives but
before the function main(). Your main function will be almost identical to FileIO.c above, except that the
while loop will contain a call to stringReverse(word). Although it is possible to write stringReverse()
as a recursive function, it is not recommended that you do so unless you are very familiar with C strings and the
string handling functions in the standard library string.h. Instead it is recommended that you implement a
simple iterative algorithm to reverse the string s (see below).
There is one function from string.h that you will need however, and that is strlen(), which returns the
length of its string argument. For example the following program prints out the length of a string entered on the
command line.
/*
* charCount.c
* prints the number of characters in a string on the command line
*/
#include
#include
#include
int main(int argc, char* argv[]){
if(argc<2){ printf("Usage: %s some-string\n", argv[0]); exit(EXIT_FAILURE); } printf("%s contains %d characters\n", argv[1], strlen(argv[1]) ); return EXIT_SUCCESS; } Remember that a string in C is a char array, not a separate data type as it is in Java. Recall also that arrays in C cannot be queried as to their length. How then does strlen() work? Actually a string in C is a little bit more than a char array. By convention C strings are terminated by the null character '\0'. This character acts as a sentinel, telling the functions in string.h where the end of the string is. Function strlen() returns the number of characters in its char* (char array) argument up to (but not including) the null character. Here is an algorithm in high level pseudo-code which will perform the string reversal: 1. Set two variables i and j to i=0 and j=strlen(s)-1 2. Swap the characters s[i] and s[j] 3. Increment i and decrement j 4. Go back to 2 and stop when i>=j
One more question should be answered here. Are arrays in C passed by value or by reference? Obviously by
reference, for otherwise functions like stringReverse() would have no effect on their array arguments
outside the function call. In fact an array name in C is literally the address of (i.e. a pointer to) the first element
in the array. Arrays, strings, and pointer variables in C will be discussed in subsequent lab assignments.
Write a Makefile that creates an executable binary file called FileReverse, and includes a clean utility.
Submit the files: README, Makefile, and FileReverse.c to the assignment name lab3. Start early and ask
for help if you are at all confused.