ALT-Assembly Language Tutorial

Similar documents
About the Tutorial. Audience. Prerequisites. Copyright & Disclaimer

ASSEMBLY - QUICK GUIDE ASSEMBLY - INTRODUCTION

Computer Organization and Assembly Language. Lab Session 01

Course Syllabus [1/2]

ECOM 2325 Computer Organization and Assembly Language. Instructor: Ruba A.Salamah INTRODUCTION

ORG ; TWO. Assembly Language Programming

Assembly basics CS 2XA3. Term I, 2017/18

CS401 - Computer Architecture and Assembly Language Programming Glossary By

Ethical Hacking. Assembly Language Tutorial

Practical Malware Analysis

Low-Level Essentials for Understanding Security Problems Aurélien Francillon

Name: CMSC 313 Fall 2001 Computer Organization & Assembly Language Programming Exam 1. Question Points I. /34 II. /30 III.

CHAPTER ASSEMBLY LANGUAGE PROGRAMMING

X86-NASM STANDARD COMMANDS. Comment your code with a semicolon (;)! The assembler won t read anything after it.

SOEN228, Winter Revision 1.2 Date: October 25,

Chapter Overview. Assembly Language for Intel-Based Computers, 4 th Edition. Chapter 1: Basic Concepts. Printing this Slide Show

T Jarkko Turkulainen, F-Secure Corporation

Machine and Assembly Language Principles

VARDHAMAN COLLEGE OF ENGINEERING (AUTONOMOUS) Shamshabad, Hyderabad

EECE416 :Microcomputer Fundamentals and Design. X86 Assembly Programming Part 1. Dr. Charles Kim

Microprocessor. By Mrs. R.P.Chaudhari Mrs.P.S.Patil

Assembly Language. Lecture 2 - x86 Processor Architecture. Ahmed Sallam

HUDSON VALLEY COMMUNITY COLLEGE TROY, NEW YORK COURSE OUTLINE

Basic Assembly SYSC-3006

mith College Computer Science CSC231 - Assembly Week #4 Dominique Thiébaut

Dr. Ramesh K. Karne Department of Computer and Information Sciences, Towson University, Towson, MD /12/2014 Slide 1

CS-202 Microprocessor and Assembly Language

COMPUTER ORGANIZATION & ARCHITECTURE

Assembly Language. Lecture 2 x86 Processor Architecture

We can study computer architectures by starting with the basic building blocks. Adders, decoders, multiplexors, flip-flops, registers,...

Basic Concepts COE 205. Computer Organization and Assembly Language Dr. Aiman El-Maleh

Computer Organization CS 206 T Lec# 2: Instruction Sets

Assembly Language Programming of 8085

Computer Architecture and Assembly Language. Practical Session 5

Chapter 3: Addressing Modes

CC411: Introduction To Microprocessors

Systems Programming. Fatih Kesgin &Yusuf Yaslan Istanbul Technical University Computer Engineering Department 18/10/2005

LAB WORK NO. 3 TURBO DEBUGGER ENVIRONMENT

About the Tutorial. Audience. Prerequisites. Copyright & Disclaimer. Compiler Design

CPU. IBM PC and compatible Memory Structure

Instruction Sets: Characteristics and Functions Addressing Modes

Assembly Language: g Part III. First Semester 2013 Department of Computer Science Faculty of Science Chiang Mai University

IBM PC Hardware CPU 8088, Pentium... ALU (Arithmetic and Logic Unit) Registers. CU (Control Unit) IP.

Mark II Aiken Relay Calculator

Assembly Language Fundamentals. Chapter 3

EEM336 Microprocessors I. Data Movement Instructions

EXPERIMENT 1. FAMILIARITY WITH DEBUG, x86 REGISTERS and MACHINE INSTRUCTIONS

When an instruction is initially read from memory it goes to the Instruction register.

8086 ALP TOOLS (CH 2) CHAPTER 2

reply db y prompt db Enter your favourite colour:, 0 colour db 80 dup(?) i db 20 k db? num dw 4000 large dd 50000

We briefly explain an instruction cycle now, before proceeding with the details of addressing modes.

COPYRIGHTED MATERIAL. What Is Assembly Language? Processor Instructions

US06CCSC04: Introduction to Microprocessors and Assembly Language UNIT 1: Assembly Language Terms & Directives

Assembly Language Programming of 8085

17. Instruction Sets: Characteristics and Functions

Summer 2003 Lecture 1 06/09/03

Microprocessor and Assembly Language Week-5. System Programming, BCS 6th, IBMS (2017)

Assembly Language Programming

Microcomputer Architecture..Second Year (Sem.2).Lecture(2) مدرس المادة : م. سندس العزاوي... قسم / الحاسبات

Topics. Hardware and Software. Introduction. Main Memory. The CPU 9/21/2014. Introduction to Computers and Programming

Intel X86 Assembler Instruction Set Opcode Table

CNIT 127: Exploit Development. Ch 1: Before you begin. Updated

EDIABAS BEST/2 LANGUAGE DESCRIPTION. VERSION 6b. Electronic Diagnostic Basic System EDIABAS - BEST/2 LANGUAGE DESCRIPTION

Intel 8086: Instruction Set

Microprocessors & Assembly Language Lab 1 (Introduction to 8086 Programming)

Chapter 3 (Part a) Assembly Language Fundamentals

3.1 DATA MOVEMENT INSTRUCTIONS 45

Chapter 3: Assembly Language Fundamentals. Cristina G. Rivera

Defining and Using Simple Data Types

Representation of Information

Module 3 Instruction Set Architecture (ISA)

EEM336 Microprocessors I. Arithmetic and Logic Instructions

Assembly Language for Intel-Based Computers, 5 th Edition. Kip Irvine. Chapter 3: Assembly Language Fundamentals

EEM336 Microprocessors I. The Microprocessor and Its Architecture

Computer Science and Engineering 331. Midterm Examination #1. Fall Name: Solutions S.S.#:

Assembly Language. Lecture 3 Assembly Fundamentals

System calls and assembler

UNIT I. Pune Vidyarthi Griha s COLLEGE OF ENGINEERING, NASHIK-4. 1

Lab 2: Introduction to Assembly Language Programming

Mr. Sapan Naik 1. Babu Madhav Institute of Information Technology, UTU

Constants and Expressions. Lecture 7: Assembly Language Programs. Constants and Expressions (cont.) Statements. Names. Assembly Directives


SRI VENKATESWARA COLLEGE OF ENGINEERING AND TECHNOLOGY DEPARTMENT OF ECE EC6504 MICROPROCESSOR AND MICROCONTROLLER (REGULATION 2013)

Computer Architecture and System Software Lecture 04: Floating Points & Intro to Assembly

Chapter Overview. Assembly Language for Intel-Based Computers, 4 th Edition. Chapter 3: Assembly Language Fundamentals.

Lecture 2 Assembly Language

Faculty of Engineering Computer Engineering Department Islamic University of Gaza Assembly Language Lab # 2 Assembly Language Fundamentals

CS Basics 5) Programming in Assembly Language

Tutorial Letter 103/3/2012 Computer Organization COS2621 Semesters 1 & 2

Language of x86 processor family

Computer Labs: Mixed C and Assembly Programming

Compiler construction. x86 architecture. This lecture. Lecture 6: Code generation for x86. x86: assembly for a real machine.

Modesto Junior College Course Outline of Record CMPSC 241

Computer Systems Lecture 9

PESIT Bangalore South Campus

Summary: Direct Code Generation

Instruction Set Architecture

INT 21H and INT 10H Programming and Macros

The x86 Microprocessors. Introduction. The 80x86 Microprocessors. 1.1 Assembly Language

CNIT 127: Exploit Development. Ch 3: Shellcode. Updated

Transcription:

ALT-Assembly Language Tutorial

ASSEMBLY LANGUAGE TUTORIAL Let s Learn in New Look SHAIK BILAL AHMED i

A B O U T T H E T U TO R I A L Assembly Programming Tutorial Assembly language is a low-level programming language for a computer, or other programmable device specific to a particular computer architecture in contrast to most highlevel programming languages, which are generally portable across multiple systems. Assembly language is converted into executable machine code by a utility program referred to as an assembler like NASM, MASM etc. Audience This tutorial has been designed for software programmers with a need to understand the Assembly programming language starting from scratch. This tutorial will give you enough understanding on Assembly programming language from where you can take yourself at higher level of expertise. Prerequisites Before proceeding with this tutorial you should have a basic understanding of Computer Programming terminologies. A basic understanding of any of the programming languages will help you in understanding the Assembly programming concepts and move fast on the learning track.

Copyright & Disclaimer Notice All the content and graphics on this tutorial are the property of Bilal Ahmed Shaik. Any content from Bilal Ahmed Shaik or this tutorial may not be redistributed or reproduced in any way, shape, or form without the written permission of Bilal Ahmed Shaik. Failure to do so is a violation of copyright laws. This tutorial may contain inaccuracies or errors and Bilal Ahmed Shaik provides no guarantee regarding the accuracy of the site or its contents including this tutorial. If you discover that the Bilal Ahmed Shaik or this tutorial content contains some errors, please contact us at shaikbilalahmed@sify.com

Table of Content Assembly Programming Tutorial... 2 Audience... 2 Prerequisites... 2 Copyright & Disclaimer Notice... 3 Assembly Introduction... 8 What is Assembly Language?... 8 Advantages of Assembly Language...8 Basic Features of PC Hardware...9 The Binary Number System...9 The Hexadecimal Number System...9 Binary Arithmetic...10 Addressing Data in Memory...11 Assembly Environment Setup... 13 Installing NASM...13 Assembly Basic Syntax... 15 The data Section...15 The bss Section...15 The text section...15 Comments...15 Assembly Language Statements...16 Syntax of Assembly Language Statements...16 The Hello World Program in Assembly...16 Compiling and Linking an Assembly Program in NASM...17 Assembly Memory Segments... 18 Memory Segments...18 Assembly Registers... 20 Processor Registers...20 Data Registers...20 Pointer Registers...21 Index Registers...21 Control Registers...22 Segment Registers...22 Example:...23 Assembly System Calls... 24 Linux System Calls...24 Example...25 Addressing Modes... 27

Register Addressing...27 Immediate Addressing...27 Direct Memory Addressing...28 Direct-Offset Addressing...28 Indirect Memory Addressing...28 The MOV Instruction...28 SYNTAX:...28 EXAMPLE:......... 29 Assembly Variables... 31 Allocating Storage Space for Initialized Data...31 Allocating Storage Space for Uninitialized Data...32 Multiple Definitions...32 Multiple Initializations...33 Assembly Constants... 34 The EQU Directive...34 Example:...34 The %assign Directive...35 The %define Directive...35 Arithmetic Instructions... 37 SY NT A X :........... 37 EXAMPLE:......... 37 The DEC Instruction...37 SY NT A X :........... 37 EXAMPLE:......... 37 The ADD and SUB Instructions...38 SYNTAX:...38 EXAMPLE:...38 The MUL/IMUL Instruction...40 SYNTAX:...40 EXAMPLE:...41 EXAMPLE:......... 41 The DIV/IDIV Instructions...42 SY NT A X :........... 42 EXAMPLE:......... 43 Logical Instructions... 45 The AND Instruction...45 Example:...46 The OR Instruction...46 Example:...47

The XOR Instruction...47 The TEST Instruction...48 The NOT Instruction...48 Assembly Conditions... 49 The CMP Instruction...49 SYNTAX...49 EXAMPLE:...49 Unconditional Jump...50 SYNTAX:...50 EXAMPLE:...50 Conditional Jump...50 Example:...51 Assembly Loops... 53 Example:...53 Assembly Numbers... 55 ASCII Representation...56 BCD Representation...57 Example:...57 Assembly Strings... 59 String Instructions...59 MOVS...60 LODS...61 CMPS...62 SCAS...63 Repetition Prefixes...64 Assembly Arrays... 65 Example:...66 Assembly Procedures... 67 Syntax:...67 Example:...67 Stacks Data Structure:...68 EXAMPLE:......... 69 Assembly Recursion... 70 Assembly Macros... 72 Example:...73 Assembly File Management... 74 File Descriptor...74 File Pointer...74 File Handling System Calls...74

Creating and Opening a File...75 Opening an Existing File...75 Reading from a File...75 Writing to a File...76 Closing a File...76 Updating a File...76 Example:...77 Memory Management... 79 Example:...79

CHAPTER 1 Assembly Introduction What is Assembly Language? Each personal computer has a microprocessor that manages the computer's arithmetical, logical and control activities. Each family of processors has its own set of instructions for handling various operations like getting input from keyboard, displaying information on screen and performing various other jobs. These set of instructions are called 'machine language instruction'. Processor understands only machine language instructions which are strings of 1s and 0s. However machine language is too obscure and complex for using in software development. So the low level assembly language is designed for a specific family of processors that represents various instructions in sym bolic code and a more understandable form. Advantages of Assembly Language An understanding of assembly language provides knowledge of: Interface of programs with OS, processor and BIOS; Representation of data in memory and other external devices; How processor accesses and executes instruction; How instructions accesses and process data; How a program access external devices. Other advantages of using assembly language are: It requires less memory and execution time; It allows hardware-specific complex jobs in an easier way; It is suitable for time-critical jobs;

It is most suitable for writing interrupt service routines and other memory resident programs. Basic Features of PC Hardware The main internal hardware of a PC consists of the processor, memory and the registers. The registers are processor components that hold data and address. To execute a program the system copies it from the external device into the internal memory. The processor executes the program instructions. The fundamental unit of computer storage is a bit; it could be on (1) or off (0). A group of nine related bits makes a byte. Eight bits are used for data and the last one is used for parity. According to the rule of parity, number of bits that are on (1) in each byte should always be odd. So the parity bit is used to make the number of bits in a byte odd. If the parity is even, the system assumes that there had been a parity error (though rare) which might have caused due to hardware fault or electrical disturbance. The processor supports the following data sizes: Word: a 2-byte data item Doubleword: a 4-byte (32 bit) data item Quadword: an 8-byte (64 bit) data item Paragraph: a 16-byte (128 bit) area Kilobyte: 1024 bytes Megabyte: 1,048,576 bytes The Binary Number System Every number system uses positional notation i.e., each position in which a digit is written has a different positional value. Each position is power of the base, which is 2 for binary number system, and these powers begin at 0 and increase by 1. The following table shows the positional values for an 8-bit binary number, where all bits are set on. Bit value 1 1 1 1 1 1 1 1 Position value as a power of base 2 128 64 32 16 8 4 2 1 Bit number 7 6 5 4 3 2 1 0 The value of a binary number is based on the presence of 1 bits and their positional value. So the value of the given binary number is: 1 + 2 + 4 + 8 +16 + 32 + 64 + 128 = 255, which is same as 2 8-1. The Hexadecimal Number System Hexadecimal number system uses base 16. The digits range from 0 to 15. By convention, the letters A through F is used to represent the hexadecimal digits corresponding to decimal values 10 through 15.

Main use of hexadecimal numbers in computing is for abbreviating lengthy binary representations. Basically hexadecimal number system represents a binary data by dividing each byte in half and expressing the value of each half-byte. The following table provides the decimal, binary and hexadecimal equivalents: Decimal number Binary representation Hexadecimal representation 0 0 0 1 1 1 2 10 2 3 11 3 4 100 4 5 101 5 6 110 6 7 111 7 8 1000 8 9 1001 9 10 1010 A 11 1011 B 12 1100 C 13 1101 D 14 1110 E 15 1111 F To convert a binary number to its hexadecimal equivalent, break it into groups of 4 consecutive groups each, starting from the right, and write those groups over the corresponding digits of the hexadecimal number. Example: Binary number 1000 1100 1101 0001 is equivalent to hexadecimal - 8CD1 To convert a hexadecimal number to binary just write each hexadecimal digit into its 4-digit binary equivalent. Example: Hexadecimal number FAD8 is equivalent to binary - 1111 1010 1101 1000 Binary Arithmetic The following table illustrates four simple rules for binary addition: (i) (ii) (iii) (iv) 0 1 1 1 +0 +0 +1 +1 =0 =1 =10 =11 Rules (iii) and (iv) shows a carry of a 1-bit into the next left position. Example : 1

Decimal Binary 60 00111100 +42 00101010 102 01100110 A negative binary value is expressed in two's complement notation. According to this rule, to convert a binary number to its negative value is to reverse its bit values and add 1. Example : Number 53 00110101 Reverse the bits 11001010 Add 1 1 Number -53 11001011 To subtract one value from another, convert the number being subtracted to two's complement format and add the numbers. Example : Subtract 42 from 53 Number 53 00110101 Number 42 00101010 Reverse the bits of 42 11010101 Add 1 1 Number -42 11010110 53-42 = 11 00001011 Overflow of the last 1 bit is lost. Addressing Data in Memory The process through which the processor controls the execution of instructions is referred as the fetch -decodeexecute cycle, or the execution cycle. It consists of three continuous steps: Fetching the instruction from memory Decoding or identifying the instruction Executing the instruction The processor may access one or more bytes of memory at a time. Let us consider a hexadecimal number 0725H. This number will require two bytes of memory. The high-order byte or most significant byte is 07 and the low order byte is 25. The processor stores data in reverse-byte sequence i.e., the low-order byte is stored in low memory address and high-order byte in high memory address. So if processor brings the value 0725H from register to memory, it will transfer 25 first to the lower memory address and 07 to the next memory address.

x: memory address When the processor gets the numeric data from memory to register, it again reverses the bytes. There are two kinds of memory addresses: An absolute address - a direct reference of specific location. The segment address (or offset) - starting address of a memory segment with the offset value

CHAPTER 2 Assembly Environment Setup Assembly language is dependent upon the instruction set and the architecture of the processor. In this tutorial, we focus on Intel 32 processors like Pentium. To follow this tutorial, you will need: An IBM PC or any equivalent compatible computer A copy of Linux operating system A copy of NASM assembler program There are many good assembler programs, like: Microsoft Assembler (MASM) Borland Turbo Assembler (TASM) The GNU assembler (GAS) We will use the NASM assembler, as it is: Free. You can download it from various web sources. Well documented and you will get lots of information on net. Could be used on both Linux and Windows Installing NASM If you select "Development Tools" while installed Linux, you may NASM installed along with the Linux operating system and you do not need to download and install it separately. For checking whether you already have NASM installed, take the following steps: Open a Linux terminal. Type whereis nasm and press ENTER. If it is already installed then a line like, nasm: /usr/bin/nasm appears. Otherwise, you will see justnasm:, then you need to install NASM. To install NASM take the following steps:

Check The netwide assembler (NASM) website for the latest version. Download the Linux source archive nasm-x.xx. ta.gz, where X.XX is the NASM version number in the archive. Unpack the archive into a directory, which creates a subdirectory nasm-x. XX. cd to nasm-x. XX and type./configure. This shell script will find the best C compiler to use and set up Makefiles accordingly. Type make to build the nasm and ndisasm binaries. Type make install to install nasm and ndisasm in /usr/local/bin and to install the man pages. This should install NASM on your system. Alternatively, you can use an RPM distribution for the Fedora Linux. This version is simpler to install, just double-click the RPM file.

CHAPTER 3 Assembly Basic Syntax An assembly program can be divided into three sections: The data section The bss section The text section The data Section The data section is used for declaring initialized data or constants. This data does not change at runtime. You can declare various constant values, file names or buffer size etc. in this section. The syntax for declaring data section is: section.data The bss Section The bss section is used for declaring variables. The syntax for declaring bss section is: section.bss The text section The text section is used for keeping the actual code. This section must begin with the declarationglobal main, which tells the kernel where the program execution begins. The syntax for declaring text section is: section.text global main main: Comments Assembly language comment begins with a semicolon (;). It may contain any printable character including blank. It can appear on a line by itself, like:

; This program displays a message on screen or, on the same line along with an instruction, like: add eax,ebx ; adds ebx to eax Assembly Language Statements Assembly language programs consist of three types of statements: Executable instructions or instructions Assembler directives or pseudo-ops Macros The executable instructions or simply instructions tell the processor what to do. Each instruction consists of an operation code (opcode). Each executable instruction generates one machine language instruction. The assembler directives or pseudo-ops tell the assembler about the various aspects of the assembly process. These are non-executable and do not generate machine language instructions. Macros are basically a text substitution mechanism. Syntax of Assembly Language Statements Assembly language statements are entered one statement per line. Each statement follows the following format: [label] mnemonic [operands] [;comment] The fields in the square brackets are optional. A basic instruction has two parts, the first one is the name of the instruction (or the mnemonic) which is to be executed, and the second are the operands or the parameters of the command. Following are some examples of typical assembly language statements: INC COUNT MOV TOTAL, 48 ADD AH, BH AND MASK1, 128 ADD MARKS, 10 MOV AL, 10 ; Increment the memory variable COUNT ; Transfer the value 48 in the ; memory variable TOTAL ; Add the content of the ; BH register into the AH register ; Perform AND operation on the ; variable MASK1 and 128 ; Add 10 to the variable MARKS ; Transfer the value 10 to the AL register The Hello World Program in Assembly The following assembly language code displays the string 'Hello World' on the screen: section.text global main main: mov edx,len mov ecx,msg mov ebx,1 mov eax,4 int 0x80 ;must be declared for linker (ld) ;tells linker entry point ;message length ;message to write ;file descriptor (stdout) ;system call number (sys_write) ;call kernel

Thank You for previewing this ebook You can read the full version of this ebook in different formats: HTML (Free /Available to everyone) PDF / TXT (Available to V.I.P. members. Free Standard members can access up to 5 PDF/TXT ebooks per month each month) Epub & Mobipocket (Exclusive to V.I.P. members) To download this full book, simply select the format you desire below