joe0400/TeachingCBasics

Preface

This article teaching the basics of computing is written by a senior in college. This senior, wanted to make it easier in any way possible for new people to get into computer science, and learn how to develop. This will expand as more and more info is added, and is not to be used as the only source of knowledge needed.

For CS Students,

Computer Science is an extremely complicated but rewarding field. Some of the concepts when talking about computer hardware or computing philosophy, can be quite difficult for those with no background. Persevere through it, it is worth it.
In this teaching series we use the language C and C++ for most of it. I personally believe that due to the exposure to hardware that these languages allow, it makes the learner have a better understanding of how the hardware works.\

PART 0 DATA TYPES

In all programming languages they have data types. These data types are able to be used to initialize variables depending on what type of information they want to store. Here we will list the primitive types.

-------------------------(C/C++)-------------------------
short:     short integer   (usually 2 bytes)
int:       integer         (usually 4 bytes)
long:      long integer    (usually 8 bytes, sometimes 4)
long long: long integer    (usually 8 bytes)
float:     single precision floating point 
double:    double precision floating point
char:      character       (byte)
--------------------------(C++)--------------------------
bool:      boolean

The list of these types seem quite arbirtrary, After all why are four integer types? Why are there two floating point types? What is a floating point? What is a Integer? What is a boolean?

PART 0.1 Integers

Integers are values that can be represented as whole numbers in math. IE {-1,0,1}. Seems simple enough. However in all computers, because of the limited size of the memory available, and how hardware is designed, we put sizes on these integer types. These sizes determine how big of a range each integer type has. That is why there are four integer types. You can actually calculate the range of the integer type quite easily. \

lower_bound = -2^(bits-1), upper_bound = 2^(bits-1)-1

The reason why this equation works is due to the fact how computer hardware stores these integers and why we do it the way we do.\

Binary Numbers

So integers are stored in hardware as binary data. Binary data is represented as on (1) or off (0). The individual data slices (singular ones or zeros) are called bits, and a group of eight are called a byte. How integer data is held in memory are in groups of bytes, composed of bits, which stores the binary form of that decimal number. Now to understand how to convert binary numbers to normal arabic numbers, there is a easy to understand method.
Lets say I have the binary number 0b 0101 0110 (0b is just a prefix to identify the following digits are in binary) that I want to convert to decimal. What I do is I multiply the bit by 2^(its position). Then I sum all the numbers. For example

0b | 0   1   0   1   -   0   1   1   0
 *   2^7 2^6 2^5 2^4     2^3 2^2 2^1 2^0
 = | 0 + 64+ 0+ 16+      0+  4+  2+  0
 = | 64 + 16 + 4 + 2 =   86

How do we convert decimal to binary though?
We divide the number from the largest 2^n that resolves in 1 and we continue down from 2^(n-1) to 2^0 on the remainder of each. For example

decimal 100
100 / 2^6 = 1, r36         :   0b| 1...
  36 / 2^5 = 1, r4         :   0b| 1 1...
    4 / 2^4 = 0, r4        :   0b| 1 1 0...
      4 / 2^3 = 0, r4      :   0b| 1 1 0 0...
        4 / 2^2 = 1, r0    :   0b| 1 1 0 0 1...
          0 / 2^1 = 0, r0  :   0b| 1 1 0 0 1 0...
            0 / 2^0 = 0, r0:   0b| 1 1 0 0 1 0 0
thus 0b| 0 1 1 0 - 0 1 0 0 = (100)

To understand why this works, we have to understand how all numbering systems work.
Decimal, or Base 10, has ten individual symbols to represent numeric values. These are our digits 0-9. We have the places that each number sits in to represent further numbers past nine. These are our places. Now each place is going to be 10^(position of the number)'s place. This is becauase after 9, we cant fit a 10 number in the first place, so it 'rolls' over to the next place, and we get the 10s place, then the 100s... All other base N systems follow a simmilar system. Binary, or Base 2 is very similar. We have 2 digits, 0 and 1. To fit 2, we 'roll' to the next spot to get it. Thats why it works, cause each place is going to be a version of 2^n.\ There are other numbering systems too. Base 16, or hexadecimal works in a similar way. It has digits 0,1,2,3...9,A,B,C,D,E,F, and has 16s places.
What about adding? Again works very similar to decimal, when the number exceeds the base, it rolls the digit over to the next spot to add in. To show what I mean, look at this text box where I add two binary numbers.

carry| 1 1
   0b| 0 1 1 0 - 0 0 1 0
+  0b| 0 1 1 0 - 0 0 0 0
--------------------------
   0b| 1 1 0 0 - 0 0 1 0

Seeing how the two 1s add up, it is quite similar to decimal addition. What about subtracting? Good question.

Two's Compliment

Because of the fixed size of these integers, there is a occurance that happens when you add two high binary numbers, its something called a integer overflow. To show what I mean, I will add two large numbers

    0b| 1 0 0 0 - 0 0 0 0 (256)
  + 0b| 1 0 0 0 - 0 0 0 0 (256)
  -----------------------
    0b| 0 0 0 0 - 0 0 0 0 (0)

These two numbers added overflow the range of the integer and flip back. This is what happens in an integer overflow.
How we harness this overflow for negative integers, is called Two's Compliment. The process is as such. We flip all bits on the binary number, and add one to it. This is how we get the negative of a binary number.

 invert 0b| 0 0 1 0 - 0 1 1 1 (49)
 =      0b| 1 1 0 1 - 1 0 0 0
 +      0b| 0 0 0 0 - 0 0 0 1
        ---------------------
        0b| 1 1 0 1 - 1 0 0 1 (-49)

To prove that its the negative, we will take the original and add it to its negative. This should result in zero.

 carry | 1 1 1 1   1 1 1
     0b| 0 0 1 0 - 0 1 1 1 (49)
   + 0b| 1 1 0 1 - 1 0 0 1 (-49)
   -----------------------
     0b| 0 0 0 0 - 0 0 0 0 (0)

The addition of these two numbers resulted in zero, meaning that the it is the negative of the original number. What about the method we use to find the decimal value of a binary number? Wont this mess it up? Yes, however one of the properties of a negative two's compliment binary number is that the left most digit is going to be one if its negative, zero if its positive. If its negative, you could just get the two's compliment of that number, then find the decimal value from that.

PART 0.2 Floating Points

This section is going to be more of a brief overview of floating points, as it is too complicated to show how it works under the hood.
Floating points are decimal numbers that support numbers with decimal points. This means you could store numbers such as 3.14 and others. Precision is how far in the decimal point you can accurately store. Single precision floating points can hold all short numbers safely, and all double precision floating points can hold all int numbers safely.

PART 0.3 Characters

Characters are individual letters and other symbols. char stores such characters. Under the hood the language we are using uses ASCII or (American Standard Code For Information Interchange). This is a set of numbers that represent individual characters, not just including letters. This is how text is displayed.

PART 0.4 Boolean

Booleans are very simple. They are true or false values.

PART 0.5 Variables

Variables are named positions in memory which have a specific type. To instantiate a variable we do such:

type_goes_here name_goes_here;

For example if we wanted a int with the name computers we could do this:

int computers;

All instatiated variables are filled with garbage values (random values) unless assigned a value. To assign a value we use the equals operator. To assign a value to an already instantiated variable we could do this:

variable_name = value_goes_here;

You can also combine an instantiation and an assignment. How this is done is as such

type_goes_here name_goes_here = value_goes_here;

For example if we wanted a int variable computers with the value equal to 32 we do this:

int computers = 32;

For characters the value assigned needs to be in single quotes. For example if I wanted a variable first_initial with type char, and assigned to J I would need to do

char first_initial = 'J';

PART 0.6 Casting

Casting can be both implicit and explicit. Casting is when you change one data type to another data type. Casting can be implicit when its safe (looses no precision / values). Casting to unsafe types requires to use a cast operator. Explicit casting is done as such:

int number = 32;
short result_cast = (short)number;

This casts number to a short and assigns it to result_cast. Below I put a tree with the order of safe casting. You can cast from any position to a position below it.

bool -> char -> short -\---> int ---\---> long ---> long long
                        \            \
                         \-> float ---\---> double

Safe casts can be casted implicitly. This means that if you assign a value from a short to a int it will automatically cast safely.

PART 0 Questions

What is the decimal of 0b 0010 1110?
What is the negative of 30 (byte size integer)?
What is 40 + 0b 0001 1010?
What is 40 - 0b 0001 1010?
Are these variable assignments correct? Which ones are incorrect and why?

int total = 42;
char value = 'd';
float number = 32.0;
short thing = 'g';
char alpha = 32.2;
int zdepth = 33.1;
float intial = 'D';

PART 1 OPERATORS

Operators are a essential component of programming languages. They provide us with the ability to manipulate the data itself. There are many operators in C. We are going to start with comparison operators, as they are simple.

PART 1.1 Comparison operators.

There are a few comparison operators in C. Comparision operators are operators that compare two values and return a value that is true or false (one or zero). The operators are as such

a == b: a equal to b
a != b: a not equal to b
a <  b: a less than b
a >  b: a greater than b
a <= b: a less than or equal to b
a >= b: a greater than or equal to b

These operators return true (one) if the statement is correct, and false (zero) if the statement is incorrect

PART 1.2 Boolean Operators

Boolean operators are operators that take one or two boolean values (true or false) and return a boolean value. The operators are as such:

!a:     not a
a || b: a or b
a && b: a and b

The not operator returns the opposite of the boolean value. If true it returns false, and false returns true. The or operator returns false only if a and b are both false, otherwise true. The and operator return true only if a and b are both true, false otherwise.

PART 1.3 Other Operators

Many of these mathmatical operators you probably already know of, however C has a few bitwise operators. The example of all C's other operators are as such:

a + b:  a plus b
a - b:  a minus b
-a:     negative of a
a / b:  a divided by b
a * b:  a multiplied by b
a++:    increment a
a--:    decrement a
~a:     bitwise invert of a
a | b:  bitwise or of a and b
a & b:  bitwise and of a and b
a >> b: a bitwise left shift by b bits
a << b: a bitwise right shift by b bits
a = b: a assigned b

The bitwise operators apply to the individual bits. So for bitwise invert would flip every bit on that variable to the opposite value for example. For bitwise or it every bit that is one on a or b that is active. Bitwise and is only every bit active on both a and b. The shift operators rotate the bits by n bits in either direction, dropping the bits off the end.
To understand better, I will provide examples here:

0b|1010 0101 bitwise invert = 
0b|0101 1010
0b|1010 1010 bitwise or with
0b|1100 0011 = 
0b|1110 1011
0b|1010 1010 bitwise and with
0b|1100 0011 = 
0b|1000 0010
0b|1010 1010 left shift by 2 =
0b|1010 1000
0b|1010 1010 right shift by 2 =
0b|0010 1010

PART 1.4 Order of Operations

Like with math, operations on progams have an order. This applys to all operators. The order of operation is as such from top down:

a * b, a / b
a + b, a - b, -a
a++, a--,
a & b, a | b, a~
a >> b, a << b
a == b, a != b, a < b, a > b, a <= b, a >= b
a && b, a || b, !a
a = b

The program will compute the output following the order of operations.

PART 1 Questions

For all following questions assume as follows:

a = 10
b = 3
c = 49
d = 2
e = 1

Answer the questions with the result of the operations:

a - 7 == b
c - a * 4 <= b
a * a == (b + d) * d * a

PART 2 BASIC PROGRAM

To make a compilable C program, it requires not much. Below is the begining of a compilable C program.

#include <stdio.h>
// comments
int main(int argc, char** argv){
	return 0;
}

Much of this you havent seen before, I will explain. #include <stdio.h> is what is called a preprocessor include. A preprocessor include imports a library, or a external codebase into your program. The external codebase contains a header file, stdio.h in our example. This header file contains all the definitions of functions that library contains. stdio.his called the Standard Input Output Library, or STDIO for short. This library allows us to request input or output values onto the terminal window through various functions.
// indicates the beggining of a single line comment. Comments are to provide clarity for the reader of the code, however they are not compiled.
int main(int argc, char** argv){ is a function definition called the program entrypoint. The program entrypoint is where the program begins executing code at. In C the program entrypoint is at a function called main. All functions in C have a return type, which is the type of value that the function hands back when completed. In main's case, it is type int. For main the values that the function hands back is called a exit code. Exit codes tell the operating system about how the program exited, if it exited with an error or not, and what type of error.
int argc, char** argv are called the function parameters. For main's case its the terminal arguments. argc is the count of the arguments provided, and argv is the text of the argument values.
return 0; returns exit code of the program and finishes the program.
What is defined between main's { and } is called the body of the function and is executed when ran.
Dont worry, this is a lot to take in, and most of it you do not need to remember. Just remember that the begining of the program happens inside main, and the text for the beginning of a program

PART 3 WRITING AND COMPILING

From this point on, it is required to use Ubuntu Linux with GCC. If you do not have a machine that runs Ubuntu Linux head to Adendum and follow the directions.
Begin by writing your first C program by using this template in a program called text editor.

#include <stdio.h>

int main(int argc, char** argv[]){
	printf("Hello World!");
	return 0;
}

and save it in your home directory named as hello.c. The .c indicates the type of file that you have just wrote. This file is called a C Source File and contains the code of your program. Now open your terminal. You should see something like this:

user@pc$~:

The first term indicates the user of the terminal, and the second terminal indicates what computer they are on. The $ indicates what level of privileges the terminal has. In this case it has the privileges of the group your user is in.
Now to compile your program. Compiling is when you convert human readable code into machine readable code. The Compiler is the program we are going to use to compile our source file into a program. The Compiler we will be using is called GCC or GNU C Compiler. To compiler our program we will run this command gcc hello.c -o hello. the first argument is the source file of our program. -o indicates the outfile, and immediately following it is the program executable name.
To then run our program which is called hello, we will execute it by typing ./hello. This executes the program.
You wrote, compiled, and executed your first program! Congratulations! From this point on we will be using this method to write programs.

PART 4 FUNCTIONS

Functions are going to be the backbone of most of your program. Functions allow you to define routines into seperate executable chunks. The point of a function is to make the program more readable and easier to debug.
Functions contain at least a function definition and may contain a function declaration. Function definitions contain the parameters, return type, funciton name and the body of the function. Parameters are what is handed to the function to execute for a desired result. These are defined as typed variables. The return type is what the function returns after finishing executing back to the location it was called. The function name is what the function is called by. The body of the function contains all the code for the function. Functions are called and execute from where it was called at.
Function declarations provide the name, the type of the parameters, and the return type
A example function declaration and definition are provided below:

int square(int); // declaration
int square(int x){ // definition
    return x*x;
}

C can return any type and one special return type. void returns nothing to the point it was called.

PART 4 Questions

Below you will find a imcomplete program with function declaration and a main driver. Complete the code for the desired effect.

#include <stdio.h>
#define PI 3.14

double find_area_of_circle(int);
double find_area_of_circle(int radius){
	// Fill this stub to complete the program
	// This stub has radius as a parameter, and
	// should return the area of the circle
	// for pi, use PI
}

int main(int argc, char** argv){
	for(int i = 1; i < 101; i++){
		printf("the circle with radius %d has an area of %lf\n",i, find_area_of_circle(i));
	}
	return 0;
}

PART 5 Printing

stdio will be a vital part of our programs, and we will need to learn first how to write to the terminal using the functions in this library and how they work.
Most importantly we need to learn what a string is. A string is a array of characters. A null terminated string is a string that ends with the value zero, also known as a null character. stdio uses null terminated strings to provide for IO. These strings defines the specification of how to print or read from the terminal. The strings that define that specification are called format strings and are defined by character combinations for replacements.
The specification for printing to the screen is provided below.
printf(format_string, arguments); prints the string with the arguments to replace values with to the terminal.
Format strings are strings where character combos are replaced on screen. The specification for the combos are provided below

%d: decimal
%f: float
%ld: long decimal
%lf: long float

Strings also can contain escape sequences. Escape sequences are character combos that trigger the terminal to act differently. Right now all you need to know for escape sequences is the line feed. The Line feed escape sequence is \n.\

For example if we wanted a program to write a few integers and a double to the terminal, we would use printf. Lets say the variables int total, int age, int acct_id, double total_distance_traveledare provided. We would write a printf call like this:

printf("account: %d for %d months has accumulated %lf miles for %d$",acct_id, age, total_distance_traveled, total);

PART 6 CONTROL AND FLOW

Programs must be able to act differently based on state. State is the current condition of the program. Without the ability we would be just defining math statements. We have multiple ways to control the direction a program is going in. The first one is called an if/if else/else statement.\

PART 6.1 if/if else/else statements

An if statement is a conditional control statement that executes based off the condition provided. . An if statement executes what is inside { and } whe the condition provided is true. An example if statement is provided here:

if(age < 21){
    printf("YOU ARE UNDERAGE!\n");
}

In this if statement, it will print YOU ARE UNDERAGE! if age is less than 21. However if your age is equal to or over 21 nothing happens.
else if statements are a control statement that occurs only if the proceeding if statement fails and it condition is true. We could extend this so that if age is over 49 we could print that its happy hour.

if(age < 21){
    printf("YOU ARE UNDERAGE!\n");
}
else if(age >= 50){
	printf("happy hour! :)\n");
}

But what about if your of age, but under 50? We could solve this by using an else statement. Else statements are executed if and only if all previous else if's and if statements are not executed. To implement a check for between 21 and 50, we will add an else statement to the existing code.

if(age < 21){
    printf("YOU ARE UNDERAGE!\n");
}
else if(age >= 50){
	printf("happy hour! :)\n");
}
else{
	printf("time for a drink...\n");
}

In this case it will print time for a drink if we are between 21 and 50. If less than 21, it will print that we are underage. Finally if we are above 50 it will let us know that we are at happy hour.

PART 5.2 Switch statements

Switch statements allow us to define routines for each case. These cases are matched to the parameter of the switch statement. A example would be such:

// lets say that checking accounts are 0, 
// savings are 1, and money markets are 2
double apr;
switch(acct_type){
    case 1:
	    apr = 0.0;
	    break;
	case 2:
		apr = 0.0001;
		break;
	case 3:
		apr = 0.015;
		break;
}

It allows you to define a set of routines for each case and make the code more readable. Switch statements are strong at differentiating based on a large group of conditions compared to a if else statement.

PART5.3 Looping

Looping is vital to our programs. It allows us to deal with more than one value at a time. There are three types of loops. while loops, for loops, and do while loops.
While loops are loops that execute only when the condition is present. How while loops work is on entrance to the loop it checks the condition. It then procedes to execute its whole body if its satisfied. It will continue to do this untill its condition fails. This example will double its value until its over 500.

int value = 1;
while(value <= 500){
    value = value * 2;
}

For loops are similar but provides a spot for initialization, a conditional statement, and a increment statement. This makes it really good for incrementing a number until it reaches a value. For example this example will count untill 100;

for(int i = 0; i < 100; i++){
    printf("%d\n",i);
}

The first statement inside the parenthes is the initializaition statement. We define an int inside of it, int i. (loops often use the letters i,j,k for looping. This is called ijk notation) The next statement is the conditional statement. Finally the incremental statement increments i by 1 before checking the condition again.
Do while loops are very similar to while loops. The major difference is do while loops will execute once and then check the conditional before executing again. The do while loop looks like such:

do{
    body;
}while(conditional);

everything in the body is executed at least once, until the conditional is satisfied.

PART 5 Questions

Write a program that will compute the sum from 0->100 of all sums from 0->n of k where k is the inner sum, and n is the outer sum variable. Display this sum on screen with a printf statement.

PART 6 ARRAYS AND POINTERS

Arrays are a group of values all under one name. Arrays will share like type, but each element will vary in value. An element of an array is a singular value from that array. Arrays are allocated at compile time for its size. Array size is equal to the length of the array multiplied by the length of the array.
Pointers are named locations in memory (variable) that stores the location of another variable. To understand this better thing of it this way. Pointers are like the street adress of the friends house. You can go to your friends house to meet your friend there, but you do not own your friend directly.
Pointers contain a reference address which is an address in memory. An address in memory is the location where the value is stored. Pointers allow you to look up for that value and modify that value without directly owning it.
You may ask why is this necessary? Well how would we modify elements in array without owning them without it. Pointers allow us to just go to the next spot and access it. Arrays are defined in the first elements address. To better visualize this, underneath is provided a image of a facetious memory map.

        [          array         ]         [ x ptr]
address |  0 |  1 |  2 |  3 |  4 |  6 |  7 |    8 |
value   |  9 | 10 | 22 |  3 | -5 | 92 | 32 |    0 |

If we go to x ptr and look at its value, its 0, which is the begining address of the array. Lets say we wanted to get the third element of the array, we would add 2 to the base address of the array, and we would get the value of the third element. That is the power of pointers.
But how do I even use a pointer or declare one?
The declaration of a pointer is a type with a star following it. For example a int pointer would be declare like int* a. To assign a pointer to a location of a variable we can do that by using the reference operator. The reference of operator is a ampersand followed by the variable name. lets say we wanted to assign the address of b to int* a. we can do that by doing this: int* a = &b; How we can retrieve or assign a value to the location that int* a stores is by using the dereference operator. The dereference operator is * and precedes the pointer name. Lets say we wanted to assign 3 to the location int* a stores. We could do that by doing this: *a = 3; It is important to note that memory addresses could be invalid, and a special value called NULL points to a invalid memory address to indicate an error or no value. To check if a pointer is pointing to NULL we can do a simple conditioanl equals check against NULL.
What about arrays?
Arrays are special and have to be intialized by creating a new array. This is done type array[num] where type is the data type, array is the array name and num is the total number of elements. This will allocate enough memory for the array to use. To grab elements from an index we can either add to the array value and dereference, or better, use the indexing operator. The indexing operator is array[num] where num is the location of the element in the array. It is important to note in C arrays are zero indexed. This means the arrays first element starts at 0 and ends at n - 1 elements.

PART 6 Questions

Finish the program's function to calculate all the squares of an array.

#include <stdio.h>
// array is the array, and length is the length of the array
void calculate_array_squares(int* array, int length){
	
}
int main(int argc, char** argv){
	int array[100];
	for(int i = 0; i < 100; i++){
		array[i] = 100 + 2*i;
	}
	calculate_array_squares(array,100);
	for(int i = 0; i < 100; i++){
		printf("%d is the result of the function\n",array[i]);
	}
	return 0;
}

PART 7 RECURSION

Recursion is defined where a problem can be called on a subcase of itself. Recursion is useful for solving problems where the problem cant be identified as easily on a large scale comparatively to a problem in terms of itself. All recursive functions will need a base case or you run the risk of having a stack overflow. A stack overflow is when all the saved values from calls to the functions overflows the total amount the stack can save. Recursion is done often with calls to a function from itself. A famous example is fibonacci sequence.

void fibonacci(int a, int b, int under){
    if(a >= under){
	    return;
	}
	printf("%d, ",a);
	fibonacci(b, a+b, under);
}

This will print all elements of a fibonacci sequence by calling itself with the next elements of fibonacci.

PART 7 Question

Write a program that will solve the Collatz conjecture for an arbitrary N

PART 8 STRUCT

Structs are defined data collection that contains fields of any type that you can access using the member operator. Structs have to be defined before being delared, and used. To define a struct, we can do this:

struct Person{
    char* first_name;
    char* last_name;
    int age;
    double height;
 };

All variables declared to this struct type must use the type struct Person. To declare a variable of that struct type we can do such struct Person me; To assign values to structs we must used the member operator. To change the age of variable me we can do such me.age = 3;. If the variable is a pointer to a struct we must use arrow notation to access a member. Assuming struct Person* customer contains a valid memory address, we could change the height value by doing customer->height = 1.3;.

PART 8 Questions

Declare and define a struct that contains fields for info needed for a bank account.

PART 9 MEMORY ALLOCATION

Allocation is when the operating system reserves memory for use within the program. Memory allocation is necessary for a program that can vary based off of data and lengths of said data. Allocated memory needs to be free'd when finished being used. Allocated memory that is not free'd results in a memory leak. A memory leak causes system memory allocated to not be free'd and returned to the os for other programs to use. This leads to poor performance and possible crashing.
To allocate and free memory it is import to use a preprocessor include to include stdlib.h also known as the Standard Library. Inside stdlib.hmultiple functions are defined. malloc(size_t) returns a pointer if it is able to allocate a memory region of size size_t. If it fails to allocate memory it returns NULL. Trying to access memory at NULL will lead to a crash. To get the size of a data type the sizeof(type)will return the size_t of the datatype. This includes user defined types such as struct's. You can multiply sizeof(type)by n to to get the size of a array of n elements of type. To free allocated memory, the free(pointer)function will free the memory region associated with the pointer. Free does not modify the value of the pointer however, so be wary of dangling pointers. Dangling pointers are pointers that point to an inaccessable region of memory. Best practice when freeing is to pass a destructor a pointer to the pointer. This way you can free the dereference of the pointer, and then assign null to the dereference of the pointer. For example:

typedef void* object;

object destroy(object* obj){
	free(*obj);
	return *obj = NULL;
}

This removes the chances of trying to access a dangling pointer.
Tools such as Valgrind are available to debug for memory leaks and dangling pointers. To run Valgrind, you must install it on your Ubuntu virtual machine. To install valgrind you must run the command sudo apt-get install valgrind , enter the root pasword you had setup, and accept by typing y. To use Valgrind, you enter valgrind ./executable_name into your terminal. It will let you know if you had any malloc's and if you missed any free's.

PART 10 OBJECT ORIENTED MODEL

Part 9 described a method for dynamically allocating memory for data/data types. Part 8 described structs, a user defined data type that has member fields. These two methods allow us to define objects which allow us to abstract away implementation from the program / users. This is useful because it makes the code more human readable and easier to understand, whilst allowing us to hide the implementation. This allows us to make it into a black box. Using a typedef object_name void*; allows us to expose zero implementation of the fields. This uses the fact that any pointer can be cast to void* and back. The Object Oriented Model describes objects that have fields which contain data and state for that object, and methods which act upon instances of the object. All objects have a constructor, destructor, and methods. Constructors allocate the memory needed for the object to be instantiated. Destructors safely delete the memory of the fields than the memory of the object.
Lets say we wanted to make a Color object. This Color object should be able to contain individual red, green, and blue values. It should also be able to access its red, green, and blue values. These methods are called Accessors or Getters. It should also be able to modify these values. These methods are called Mutators or Setters. Then we should be able to calculate the HSV, and return it in a external object called HSV_Color using its __init__HSV_Color(double hue, double saturation, double value);
First lets start with the fields, which would be under a structure, then we will declare the methods, constructor, and destructor.

typedef struct{
    int r;
    int g;
    int b;
} color;
typedef void* Color;

Color __init__Color(int r, int g, int b);
int Color_get_r(Color self);
int Color_get_g(Color self);
int Color_get_b(Color self);
void Color_set_r(Color self, int r);
void Color_set_g(Color self, int g);
void Color_set_b(Color self, int b);
HSV_Color Color_get_HSV_Color(Color self);
Color __del__Color(Color* self);

We have just defined the API or Application Programming Interface for this library. Now lets define the implementation of the constructor.

Color __init__Color(int r, int g, int b){
    color* ret = (color*)malloc(sizeof(color));
    if(ret == NULL)
	    return NULL;
	ret->r = r;
	ret->g = g;
	ret->b = b;
	return (Color)ret;
}

The constructors whole point is to return a object that has been allocated with all the values correctly assigned.
A destructor call on this object is relatively simple, below we will implement the destructor for the object.

Color __del__Color(Color* self){
    if(self == NULL || *self == NULL)
	    return NULL;
	free(*self)
	return *self = NULL;
}

The destructor frees the allocated memory, making sure the pointer handed is a valid pointer. It also assigns the pointer to NULL making sure that no hanging pointers are left. It also returns NULL so that if someone trys to assign using the destructor it returns NULL so no hanging pointers are left.
Getters / accessors are all quite similar. Below I have provided one getter / accessor method.

int Color_get_r(Color self){
    if(self == NULL){
	    return -1;
	}
	return ((color*)(self))->r;
}

The function checks to see if self is a valid pointer before trying to access values related to the pointer. It then casts the void* to a color* to access a field in it. Mutators / Setters are similar to this. Below see a Mutator / Setter.

 void Color_set_r(Color self, int r){
    if(self == NULL){
	    return;
	}
	((color*)(self))->r = r;
}

Again checking to see if a pointer is valid is important

PART 10.1 Inheritance

Inheritance is when a object inherits the behavior of another objection. This means that you do not have to remake the function for said definition. This allows you to make more concies programs that allow for polymorphic functionality. This means that the program can have multiple types that work for the implementation. Generally this is done by containg the information within the inherited objects structure at the begging of the object. This information is called the base type. Because the data is at the beggining of the struct it means that the call to it will have all the superclass type info stored at thesame location as the inherited class itself. Due to this it inherits all the functions of the inherited class. Inheritance can be showed by drawing a tree from the base class to each inherting class. A good example of inheritance is often found in graphics librarys.

surface
| window
Box
| Button
| | CheckBox
| | RadioButton
| TextBox
| Rectangle
Renderable
| Button
| | CheckBox
| | RadioButton
| TextBox
| Rectangle
Action
| Button
| | CheckBox
| | RadioButton
| Field
| | TextBox

Above is a fictional inheritance tree. As you can see inheritance allows you to compose the functionality of objects in relation to each other.

PART 10.2 Interfaces

Interfaces allow for a programmer to define the api for a programs object without implementing the object, as each object that inherits such interface will have a different implementation. This can be done in a simple way. By attaching to the object a structure that contains those functions as funciton pointers. Function pointers provide the reference to a function so that the user can call the function within the program. How function pointers work is such type (*function_name)(parameter_types) = function; This allows you to define the functions and can call it directly from the object itself. In this case the object containg the function call should have the self or this value parameter type show said implementation through the struct. And the struct should be visable to the user.

PART 10 Questions

Create an API for calculating the interest on a bank account with the object Customer containing fields for customer_id,customer_age,account_age,interest_rate and any others you think of. Use valgrind to debug for any memory leaks.

PART 11 HEADERS AND PREPROCESSOR

Header files are a vital part to developing an API. Header files allow you to only expose parts of the API and hide the implmentation in a Compiled Object File. Compiled Object Files have the extension .o. To make a header file you must save as a .h to save the C code as a C header. To then make the implementation you create a .c file with all the definitions for the functions inside it. At the top include a preprocessor include as such #include "header_file_name.h".
The header file should only include declarations of functions. Its important as well to include at the top of the header file a header gaurd. This prevents the header from being imported twice and introducing conflicting declarations. See below for a example of a header gaurd.

#ifndef HEADER_NAME_H
#define HEADER_NAME_H
#endif //HEADER_NAME_H

To then compile your library you will want to use gcc -c library.c -o library.o. This compiles your implementation to a object file. You then can include the header in your main program, or your driver by using a preprocessor include. That include must be using quotations just like before. To then compile that driver to include the library, we can use gcc driver.c library.o -o program to compile and link. Linking is the action of taking compiled object files and translating them into the driver program so that the program can use the functions.
Preprocessor Macros are a fundamental feature of the C language. Preprocessor Macros allow you to concatenate arguments with the text, or use specific text if and only if a defined symbol exists using a #define tag. Concatenation is the act of combining two text elements together. To make a preprocessor text replacement macro, we can do such. Lets say we wanted to make it so that a variable name changes on the input of a macro. Below is how it is done

#define custom_variable_name(Type, Name)\
    Type variable_##Name;
custom_variable_name(int, yello)

The ## characters indicate concatenation of two strings. In this case its between argument Name and text variable_. The call to the macro means that it will create a variable of type int with name variable_yello.

PART 11 Questions

Take the assignment from last time and split it up into a driver, c implementation file, and a header file. Recompile the code using the method provided above.

PART 12 GENERICS

Generics are a way of defining the objects with data types that are not necessary to the implementation, only to the user. This means we can make a object with fields of type int and one of fields type double. It also means we can make functions that are not object functions where the data types only matter to the user. This is done with Preprocessor macros. We can use text replacement to implement a way of making a generic templated function. See below

#define MakeAdd(Type)\
    Type add_##Type(Type a, Type b){\
	    return a + b;\
	}
MakeAdd(int)
MakeAdd(double)

This allows us to make a generic function where it covers many implementations without having to repeat ourselves. If we need a set of objects that use int but vary by nothing but type compared to double, this is a good way of dealing with that.

PART 13 DATA STRUCTURES

Knowing what we know now we can implement different forms of data structures. Data structures are the ways we organize our data. Arrays are a data structure. However from this point on we are going to investigate different kinds of data structures and their adavantages. There are LinkedLists, ArrayList,Tree,HashMap,Queue,Stack and others. We are only going to investigate these ones here however. We will be using Big O Notation. Big O notation describes the asymptiotic upper bound of the time taken to run something. It is written as O(time_taken) and is expressed refering to the length of something.

PART 13.1 LinkedList

Linked Lists are way of storing data where the data only knows of what is ahead of it, and possibly what is before it. This is done by pointing to the adjacent Node's. Below would be a memory map of what happens.

Memory Map
     [ Node    ] Node    ] Node    ] Node    ] Node    ]head|
Addr |  0 |  1 |  2 |  3 |  4 |  5 |  6 |  7 |  8 |  9 | 10 | 
Value| 10 |  6 |  3 |  8 |  9 |  2 | 12 |NULL| 15 |  0 |  4 |

node structure:

struct Node{
    int value;
    struct Node* next;
};

If you notice here the head variable points to a Node which contains a pointer to a next Node, so on and so forth. This allows you to add elements to the list as your working on it and remove elements dynamically. To delete the linked list you have to go to the tail and delete it and work your way back. To add a element you simply create a new node and go to the tail of the linked list and point the tail to the new element. The main issue of Linked List is the time to access an abitrary element. Its O(n) which is linear. Compare that with ArrayList access time being O(1).

PART 13.2 ArrayList

ArrayList is a dynamically sizing array, allowing a person to access any element with ease and add or remove elements. It does this by reallocating after its capacity matches its size. It then procedes to double its allocation so that its capacity doubles. The size of a arraylist is how many elements are currently being stored. the capacity is how many it could store before a re-alloction. The main issue with ArrayList compared to LinkedList is the amount of time it takes to remove or add elements in the middle or begining. Its O(n). Compared to the LinkedList where its O(1).

PART 13.3 Tree

Trees are similar to linked lists but where each node points to two or more nodes. This allows us to store data in hierarchies. The main advantage to a tree is the insert time for a sorted tree is lower than both ArrayList and LinkedList. At O(lg(n))it is much lower. The disadvantage is when a tree becomes lopsided it takes a longer time to insert and requires more time to search.

PART 13.4 HashMap

HashMap is a form of a relational link between an element and a key. This allows us to have associativity between the key and element. This is done by relating the hash of the key to the elements index. This allows us to retrieve elements by key that is not a incremental integer at O(1). However it also has the possibility of hash collisions where two keys hashes line up at the capacity it has and needs to resolve it. There are multiple ways of resolving such conundrum. One way is by having a linked list, another is to rehas the hash. All ways end up with larger memory implementations.

PART 13.5 Queue

Queue is a data structure where a FIFO or First In First Out policy is implemented. This means that elements inserted first come out before later inserted elements. This is similar to the way queues work at restaraunts or stores. Under the hood, Queue is implemented in a similar way to LinkedList. The difference is you have a bi-directional linked list, and keep the head and tail. when you insert a element, you add them to the head. when you remove a element you remove from the tail. This way the Queue continues down the elements. The main advantage is that this both preserves order and is usable in the case of sycnronization. For example, when processes comunincate under many OS's, they use a Queue to communicate between each other to maintain order and data saftey. The operations for adding to a Queue is called enqueueing and removeing from a Queue is called dequeueing,

PART 13.6 Stack

Stack is a data structure where a FILO or First In Last Out policy is implemented. This operations are called popping when removing, and pushing when adding to the Stack. Stacks due to their nature allow for saving recursive actions, as the recursive actions will propogate up, then back down. For example how many PC's save the recursive function calls is through a stack, where it saves the varaibles, the return address on the stack in something called a Stack Frame. Stacks are also implemented like a LinkedList. The head when popping deletes itself and gets the next node. When pushing the new node becomes the head and points to the previous head.

PART 13 Questions

Implement generic templates of each of these data structures in C using preprocessor directives. NOTE: Generic Objects must have implementation stored inside the header file.

PART 14 FLOATING POINTS

Floating points are structured in a way where there are three parts. Floating points contain a Sign Bit, a Exponent, and a Mantissa. The sign bit determines if its negative or positive. The Mantissa is the fractional part of the number, and the exponent is what its brought to.
To understand floating points better, Below is how the structure of a single precision floating point.

single precision float table
sign exponent Mantissa
   1 01101111 00000000000000000000000
(-1)^sign * (Mantissa) * 2^(adjusted exponent)

We can follow these steps:

If the exponent is all 1s, and the mantissa is empty its +/- infinity
if the exponent is all 1s and the mantissa is not empty, then its NaN
If the exponent contains a 1, we prepend the Mantissa with a 'floating' one
If the exponent is empty then the adjusted exponent is 2^(exponent_size - 1) - 2
If the exponent is not empty then the adjusted exponent is exponent - (2^(exponent_size - 1) -1)
Then you calculate! (-1)^sign * Mantissa * 2^(adjusted exponent)

PART 15 MAKEFILE

Make is a powerful tool that allows you to automate building of large projects. The files that create these projects are called Makefiles. Makefiles allow you to define terminal sequences based off of dependencys and can resolve linking order. These defined sequences are called Make recipes. A simple makeifle is named Makefile and can run the recipe from the top by calling Make in the terminal. These make recipes can also be called individually by placing the recipe name after the Make command.
To make a Makefile and add recipes a simple template is below.

all: main.c linked_file_names.o
    gcc main.c linked_file_names.o -o program_name
linked_file_names.o: obect_file.cpp
    gcc object_file.cpp -c -o linked_file_names.o
clean:
    rm -rf ./*.o

The recipes are denoted by putting the name of the recipe followed by a colon (:). Everything following the colon is called a dependency. Dependencys can be resolved by the same named make recipe. If the file is available, then it wont call the make recipe, making the compilation and linking simpler. This also means that it will only recompile files that are changed. Makefiles also have other features. One of them is allowing you to hae variables, in which you can change to change all compilation settings. For example compile flags are often added to a makefile so that you can change how the program is compiled quickly and easily. Another example is also the compiler. If someone wanted to change the compiler, with a variable, simply changing the variables value will change the compiler used in all locations. To use variables simply create a variable name at the top of the file and assign the variable a value. Then where used simple type $(variable_name_here) and the variable gets inserted as text at that location. An example is provided below

compiler = gcc
flags = -O2 -werror -Wall -pedantic -Wfatal-errors
all: main.c linked_file_names.o
    $(compiler) $(flags) main.c linked_file_names.o -o program_name
linked_file_names.o: obect_file.cpp
    $(compiler) $(flags) object_file.cpp -c -o linked_file_names.o
clean:
    rm -rf ./*.o

Because of this ability it makes it incredibly powerful. However it can be even more powerful. Lets say you have a directory tree. This directory tree looks like this

Program
| main.c
--- Library
     | library.c
     | library_others.c
     | Makefile
| Makefile

Makefiles can be then used to recursively call itself down the directory tree. This also allows you to use something called wildcards. Wildcards allow you to match for a matching file name and simplify your Makefile. Make wildcards are denoted as a % folloing or precedeing the rest of the expression. An example expression would be as such:

compiler = gcc
flags = -O2 -werror -Wall -pedantic -Wfatal-errors
%.o: %.c
    $(compiler) $(flags) %.c -c -o %.o
clean:
    rm -rf *.o

Because of all of this, it allows you to compile all the files in the program without having to define every single make recipe for every file. And to call the directorys Makefile from inside the main build directory, you can simply call Make on it.

compiler = gcc
flags = -O2 -werror -Wall -pedantic -Wfatal-errors
main: main.c 
    make -C library
    $(compiler) $(flags) main.c ./library/*.o -o main

This both allows you to simplify your compilation process, but also allwos you to make the compilation simpler so that each library can be quickly compiled.

PART 15 Questions

1.) Convert your previous programs to use makefiles to compile. Recompile them using makefiles.

Adendum

For Windows PC's:

Head to Oracle's Virtualbox download and select Windows hosts. Install the VirtualBox-*.*.exe and accept all driver installs. This will install device drivers for the virtual machine. It will ask to reboot, in which you should.

For MacOS PC's:

Head to Oracle's Virtualbox download and select OS-X hosts. Install the package and accept all the permissions.

Setting up the Virtual Machine

Head to Canonical's Ubuntu download site and download the LTS version of Ubuntu for x86-64. Once the iso is finished downloading, open up VirtualBox and press the blue star indicating New. This will bring up the setup wizard. For the OS, select Ubuntu, when it asks for the install disc, select the location for your .iso file for Ubuntu, Otherwise press next. Finish setting up the Virtual Machine, and click the green arrow. If Virtualbox raises an error stating that the Virtual Machine is not able to start, you will need to turn on virtualization extensions in your BIOS. Search Google for your PC model followed by enable virtualization in BIOS and follow the instructions. Follow the onscreen prompts and setup. You should have a virtual machine setup!