0% found this document useful (0 votes)

39 views

7.x86 - Architecture

The document provides information about the basics of x86 architecture. It discusses: 1) The key components of x86 processors including integer and floating point registers, memory management unit, and support for multicore processors. 2) The 8086 processor which had 16-bit registers and used segment registers to extend the physical address space to 1MB. 3) The 80386 processor which introduced 32-bit registers and addressing while still supporting 16-bit mode for compatibility. It expanded the address space and added protection rings.

Uploaded by

yogeshwari bahiram

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

39 views

7.x86 - Architecture

Uploaded by

yogeshwari bahiram

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 94

Basics of X86 architecture

Abhijit A M
[email protected]

Credits: Notes by Prof. Sorav Bansal, https://www.felixcloutier.com/x86/, Intel Documentation

Any processor has

integer registers and their execution unit

floating-point/vector registers and execution
unit(s)

memory management unit (MMU)

multiprocessor/multicore: local interrupt
controller (APIC)

etc
Segmentation
Compiler’s view of the program
Segmentation
Address Translation
8086: 16 bit CPU
(precursor to x86 family)

https://jerry153fish.github.io/2016/01/01/8086-Registers.html
8086 Registers

16 bit ought to be enough ! (?)

General purpose registers

four 16-bit data registers: AX, BX, CX, DX

each in two 8-bit halves, e.g. AH and AL

Registers for memory addressing

SP, BP, SI, DI : 16 bit

SP stack pointer, BP base pointer, SI Source Index, DI
Destination Index

IP instruction Pointer

Addressible memory: 2^16 = 64 kb
8086 address extension

8086 has 20-bit physical addresses ==> 1 MB AS

the extra four bits usually come from a 16-bit "segment
register":

CS - code segment, for fetches via IP

SS - stack segment, for load/store via SP and BP

DS - data segment, for load/store via other registers

ES - another data segment, destination for string operations

%cs:%ip full address: %cs * 16 + %ip is actual address that
goes on bus. virtual to physical translation is

pa = va + seg*16

e.g. set CS = 4096 to execute starting at 65536
8086 address extension

Extending ‘address space’ with the help of
MMU

FLAGS register

FLAGS - various condition codes: whether last
arithmetic operation

overflowed

was positive/negative

was [not] zero

carry/borrow on add/subtract

etc.

whether interrupts are enabled

direction of data copy instructions

Uses in JP, JN, J[N]Z, J[N]C, J[N]O ...
32 bit 80386

boots in 16-bit mode, then on running particular
instructions switches to 32-bit mode (protected
mode)

registers are 32 bits wide, called EAX rather than AX

EAX EBX ECX EDX

ESP EBP ESI EDI

operands and addresses 32-bit in 32-bit mode,

e.g. ADD does 32-bit arithmetic

Segment registers are 16 bit: CS, SS, DS, etc.
32 bit 80386

Still possible to access 16 bit registers using AX or BX

Specifix coding of machine instructions to tell whether
operands are 16 or 32 bit

prefixes 0x66/0x67 toggle between 16-bit and 32-bit
operands/addresses respectively

in 32-bit mode, MOVW is expressed as 0x66 MOVW

the .code32 in bootasm.S tells assembler to generate 0x66
for e.g. MOVW

80386 also changed segments and added paged
memory...
Summary of registers in 80386
(32 bit)

General registers

32 bits : EAX EBX ECX EDX

16 bits : AX BX CX DX

8 bits : AH AL BH BL CH CL DH DL
Summary of registers in 80386

Segment Registers

CS: Holds the Code segment in which your
program runs.

DS : Holds the Data segment that your program
accesses.

ES,FS,GS : These are extra segment registers

SS : Holds the Stack segment your program
uses.

All 16 bit
Summary of registers in 80386

For a typical register, the 
SS:EBP
corresponding segment is 
Stack Segment: EBP ==>
used. Pairs of Indexes & mov (%ebp), %eax ==>
pointers (Segment & for accessing (%ebp) SS:
Registers) EBP address will be used

CS:EIP

DS:ESI , ES: EDI .

Code Segment: Index Pointer 
ESI: Extended Source

E.g. mov $32, %eax ==> The Index, EDI: Extended
code of move instruction Destination Index
uses CS: EIP

SS:ESP

The EFLAGS register

Stack Segment: ESP for flags

E.g. push $32 ==> The $32
will be pushed on stack.
Using SS: ESP address
X86 Assembly Code

Syntax

Intel syntax: op dst, src (Intel manuals!)

AT&T (gcc/gas) syntax: op src, dst (xv6)

uses b, w, l suffix on instructions to specify size
of operands

Operands are registers, constant, memory
via register, memory via constant
Examples of X86 instructions
AT&T syntax "C"-ish equivalent Operands
movl %eax, %edx edx = eax; register mode
movl $0x123, %edx edx = 0x123; immediate
movl 0x123, %edx edx = direct
*(int32_t*)0x123;
movl (%ebx), %edx edx = *(int32_t*)ebx; indirect
movl 4(%ebx), %edx edx = *(int32_t*) displaced
(ebx+4)
Instructions suffix/prefix

mov %eax, %ebx # 32 bit data

movw %ax, %bx # move 16 bit data

mov %ax, %bx # ax is 16 bit, so equivalent
to movw

mov $123, 0x123 # Ambigious

movw $123, 0x123 # correct, move 16 bit
data

Types of Instructions

data movement 
Control

MOV, PUSH, POP, ... 
JMP, JZ, JNZ, CALL,
RET

Arithmetic

TEST, SHL, ADD,

String
AND, ... 
REP MOVSB, ...

i/o 
System

IN, OUT, ... 
IRET, INT
Stack, given by %esp

Grows downwards

pushl %eax

Semantics are: substract 4 (32 bit value = 4 bytes) from esp
(grow downwards), and mov eax to memory pointed by esp, so
sub $4, %esp # esp = esp -4
mov %eax, (%esp) # *esp = eax

pop %eax

has opposite semantics, that is
movl (%esp), %eax
addl $4, %esp
Function calls using stack:
Approximate picture

main() { ... a(); ...} a(){ ... b();... c(); ...}

Each “frame’ contains the local variables for that function

GCC calling convention
int add(int x, int y) { 
Note: single set of
int z; registes for ENTIRE
z = x + y; program!
return z; 
For this code to work,
} questions in front of
compiler
int main() { 
Variables m and n have to
int m, n, sum, prod; be passed to add() - using
scanf("%d%d", &m, ‘pass by value’
&n); 
They are passed on stack –
sum = add(m, n); but how?

How to split the access of
general purpose registers?

add() should not modify
registers in access by main()
Picture of stack frame growth
GCC Calling 
after ret instruction:
%eip contains return
convention


address
Contract between caller 
%esp points at
(function) and callee arguments pushed by
(called function) caller

called function may have

at entry to a function trashed formal
(i.e. just after call): arguments

%eip points at first

%eax (and %edx, if
instruction of function return type is 64-bit)
contains return value (or

%esp+4 (upwards, trash if function is void)
reverse direction!) 
%eax, %edx (above), and
points at first %ecx may be trashed
argument 
%ebp, %ebx, %esi, %edi

%esp points at return must contain contents
address from time of call
“caller” save and “callee” save
registers

Terminology:

%eax, %ecx, %edx are "caller save" registers

%ebp, %ebx, %esi, %edi are "callee save" registers

Means, if main()->add() then code of add()
function (callee) can

freely use eax, ecx, edx and

If it uses, ebp, ebx, etc. then

it must save them before using and restore them before
returning
GCC does more than that !

function prologue: 
function epilogue

pushl %ebp 
movl %ebp, %esp

movl %esp, %ebp 
popl %ebp

(ret)

each function has a stack frame marked by %ebp, %esp

%esp can move to make stack frame bigger, smaller

%ebp points at saved %ebp from previous function,
chain to walk stack
Stack Frame
ebp, for debugging

The frame pointer (in ebp) is not strictly

Compiler can compute the address of its return address and
function arguments based on its knowledge of the current
depth of the stack pointer (in esp).

The frame pointer is useful for debugging purposes

Current function is based on the current value of eip.

Current value of *(ebp+4) provides the return address of the caller.

Current value of *((*ebp) + 4) (where *ebp contains the saved ebp
of the caller) provides the return address of the caller's caller

Current value of *(*(*ebp) + 4) provides the return address of the
caller's caller's caller, and so on . . .
Let’s see a demo of how the stack is built and destroyed during function calls, on a
Linux machine using GCC.

Consider this C code

int mult(int a, int b) { mult:
int c, d = 20, e = 30, f; pushl %ebp
f = add(d, e); movl %esp, %ebp
c = a * b + f; subl $24, %esp
return c; movl $20, -24(%ebp)
} movl $30, -20(%ebp)
int add(int x, int y) { subl $8, %esp
int z; pushl -20(%ebp)
z = x + y; pushl -24(%ebp)
return z; call add
} addl $16, %esp
movl %eax, -16(%ebp)
Translated to assembly as: movl 8(%ebp), %eax
add: imull 12(%ebp), %eax
pushl %ebp movl %eax, %edx
movl %esp, %ebp movl -16(%ebp), %eax
subl $16, %esp addl %edx, %eax
movl 8(%ebp), %edx movl %eax, -12(%ebp)
movl 12(%ebp), %eax movl -12(%ebp), %eax
addl %edx, %eax leave
movl %eax, -4(%ebp) ret
movl -4(%ebp), %eax
leave
ret
/* Control is here */
int mult(int a, int b) {
Stack int c, d = 20, e = 30, f;
f = add(d, e);
c = a * b + f;
return c;
}
X int add(int x, int y) {
b int z;
a ebp z = x + y;
RetAddr main()
return z;
}

esp * Control is here */

mult:
pushl %ebp
movl %esp, %ebp
subl $24, %esp
movl $20, -24(%ebp)
movl $30, -20(%ebp)
subl $8, %esp
pushl -20(%ebp)
pushl -24(%ebp)
call add
int mult(int a, int b) {
int c, d = 20, e = 30, f;
Stack f = add(d, e);
c = a * b + f;
return c;
}
int add(int x, int y) {
X int z;
b z = x + y;
a return z;
RetAddr main()
}

X (prev ebp)
Y
ebp
esp
mult:
pushl %ebp
movl %esp, %ebp
subl $24, %esp
movl $20, -24(%ebp)
movl $30, -20(%ebp)
subl $8, %esp
pushl -20(%ebp)
pushl -24(%ebp)
call add
int mult(int a, int b) {
int c, d = 20, e = 30, f;
Stack f = add(d, e);
c = a * b + f;
return c;
}
int add(int x, int y) {
X int z;
b z = x + y;
a return z;
RetAddr main()
}

X (prev ebp)
Y

ebp
mult:
esp pushl %ebp
movl %esp, %ebp
subl $24, %esp
movl $20, -24(%ebp)
movl $30, -20(%ebp)
subl $8, %esp
pushl -20(%ebp)
pushl -24(%ebp)
call add
int mult(int a, int b) {
int c, d = 20, e = 30, f;
Stack f = add(d, e);
c = a * b + f;
return c;
}
int add(int x, int y) {
X int z;
b z = x + y;
a return z;
RetAddr main()
}

X (prev ebp)
Y

ebp
mult:
pushl %ebp
movl %esp, %ebp
subl $24, %esp
movl $20, -24(%ebp)
movl $30, -20(%ebp)
esp subl $8, %esp
pushl -20(%ebp)
pushl -24(%ebp)
call add
int mult(int a, int b) {
int c, d = 20, e = 30, f;
Stack f = add(d, e);
c = a * b + f;
return c;
}
int add(int x, int y) {
X int z;
b z = x + y;
a return z;
RetAddr main()
}

X (prev ebp)
Y

ebp
mult:
pushl %ebp
movl %esp, %ebp
e = 30 subl $24, %esp
d = 20 movl $20, -24(%ebp)
movl $30, -20(%ebp)
subl $8, %esp
esp pushl -20(%ebp)
pushl -24(%ebp)
call add
int mult(int a, int b) {
int c, d = 20, e = 30, f;
Stack f = add(d, e);
c = a * b + f;
return c;
}
int add(int x, int y) {
X int z;
b z = x + y;
a return z;
RetAddr main()
}

X (prev ebp)
Y

ebp
mult:
pushl %ebp
movl %esp, %ebp
e = 30 subl $24, %esp
movl $20, -24(%ebp)
d = 20
movl $30, -20(%ebp)
subl $8, %esp
pushl -20(%ebp)
pushl -24(%ebp)
call add
esp
Stack
X int mult(int a, int b) {
b int c, d = 20, e = 30, f;
a f = add(d, e);
c = a * b + f;
RetAddr main()
return c;
X (prev ebp) }
Y int add(int x, int y) {
int z;
ebp z = x + y;
return z;
}

e = 30
d = 20

mult:
pushl %ebp
30 (y) movl %esp, %ebp
20 (x) subl $24, %esp
movl $20, -24(%ebp)
movl $30, -20(%ebp)
esp subl $8, %esp
pushl -20(%ebp)
pushl -24(%ebp)
call add
Stack
X int mult(int a, int b) {
b int c, d = 20, e = 30, f;
a f = add(d, e);
c = a * b + f;
RetAddr main()
return c;
X (prev ebp) }
Y int add(int x, int y) {
int z;
ebp z = x + y;
return z;
}

e = 30
d = 20

mult:
pushl %ebp
30 (y) movl %esp, %ebp
20 (x) subl $24, %esp
movl $20, -24(%ebp)
Retadd mult() movl $30, -20(%ebp)
subl $8, %esp
pushl -20(%ebp)
esp
pushl -24(%ebp)
call add
Stack X b
int mult(int a, int b) {
a int c, d = 20, e = 30, f;
RetAddr main() f = add(d, e);
c = a * b + f;
X (prev ebp) return c;
Y
}
int add(int x, int y) {
int z;
z = x + y;
return z;
}
e = 30
d = 20
add:
pushl %ebp
movl %esp, %ebp
subl $16, %esp
30 (y) movl 8(%ebp), %edx
20 (x) movl 12(%ebp), %eax
addl %edx, %eax
Retadd mult()
movl %eax, -4(%ebp)
Y(prev ebp) movl -4(%ebp), %eax
leave
ret
ebp

esp
Stack X
b
int mult(int a, int b) {
a int c, d = 20, e = 30, f;
RetAddr main() f = add(d, e);
c = a * b + f;
X (prev ebp) return c;
Y
}
int add(int x, int y) {
int z;
z = x + y;
return z;
}
e = 30
d = 20
edx = 20 add:
eax = 30 pushl %ebp
eax = eax + edx = 50 movl %esp, %ebp
subl $16, %esp
30 (y) movl 8(%ebp), %edx
20 (x) movl 12(%ebp), %eax
addl %edx, %eax
Retadd mult()
movl %eax, -4(%ebp)
Y(prev ebp) movl -4(%ebp), %eax
leave
ret
ebp

esp
Stack X
b
int mult(int a, int b) {
a int c, d = 20, e = 30, f;
RetAddr main() f = add(d, e);
c = a * b + f;
X (prev ebp) return c;
Y
}
int add(int x, int y) {
int z;
z = x + y;
return z;
}
e = 30
d = 20
eax = 50 add:
pushl %ebp
movl %esp, %ebp
subl $16, %esp
30 (y) movl 8(%ebp), %edx
20 (x) movl 12(%ebp), %eax
addl %edx, %eax
Retadd mult()
movl %eax, -4(%ebp)
Y(prev ebp) movl -4(%ebp), %eax
50 leave
ret
ebp Some redundant code
generated here. Before
esp “leave”. Result is in eax
Stack X
b
int mult(int a, int b) {
a int c, d = 20, e = 30, f;
RetAddr main() f = add(d, e);
c = a * b + f;
X (prev ebp) leave: step 1 return c;
Y
}
int add(int x, int y) {
int z;
z = x + y;
return z;
}
e = 30
d = 20
eax = 50 add:
pushl %ebp
movl %esp, %ebp
subl $16, %esp
30 (y) movl 8(%ebp), %edx
20 (x) movl 12(%ebp), %eax
addl %edx, %eax
Retadd mult()
movl %eax, -4(%ebp)
Y(prev ebp) movl -4(%ebp), %eax
50 leave # # Set ESP to
EBP, then pop EBP.
ebp ret

esp
Stack X
b
int mult(int a, int b) {
a int c, d = 20, e = 30, f;
RetAddr main() f = add(d, e);
c = a * b + f;
X (prev ebp) leave: step 2 return c;
Y
}
int add(int x, int y) {
int z;
z = x + y;
return z;
}
e = 30
d = 20 eax = 50 add:
pushl %ebp
movl %esp, %ebp
subl $16, %esp
30 (y) movl 8(%ebp), %edx
20 (x) movl 12(%ebp), %eax
addl %edx, %eax
Retadd mult()
movl %eax, -4(%ebp)
Y(prev ebp) movl -4(%ebp), %eax
50 leave # # Set ESP to
EBP, then pop EBP.
ebp ret

esp
Stack X
b
int mult(int a, int b) {
a int c, d = 20, e = 30, f;
RetAddr main() f = add(d, e); // here
c = a * b + f;
X (prev ebp) return c;
Y
}
int add(int x, int y) {
int z;
z = x + y;
return z;
}
e = 30
d = 20 eax = 50 Mult:
....
call add
addl $16, %esp
30 (y) movl %eax, -16(%ebp)
20 (x) movl 8(%ebp), %eax
imull 12(%ebp), %eax
Retadd mult()
movl %eax, %edx
Y(prev ebp) movl -16(%ebp), %eax
50 addl %edx, %eax
movl %eax, -12(%ebp)
ebp movl -12(%ebp), %eax
leave
esp ret
Stack X
b
int mult(int a, int b) {
a int c, d = 20, e = 30, f;
RetAddr main() f = add(d, e);
c = a * b + f;
X (prev ebp) return c;
Y
}
int add(int x, int y) {
int z;
z = x + y;
f = 50 (eax) return z;
}
e = 30
d = 20 eax = 50 Mult:
....
call add
addl $16, %esp
30 (y) movl %eax, -16(%ebp)
20 (x) movl 8(%ebp), %eax
imull 12(%ebp), %eax
Retadd mult()
movl %eax, %edx
Y(prev ebp) movl -16(%ebp), %eax
50 addl %edx, %eax
movl %eax, -12(%ebp)
ebp movl -12(%ebp), %eax
leave
esp ret
Stack X
b
int mult(int a, int b) {
a int c, d = 20, e = 30, f;
RetAddr main() f = add(d, e);
c = a * b + f;
X (prev ebp) return c;
Y
}
int add(int x, int y) {
int z;
z = x + y;
f = 50 return z;
}
e = 30
d = 20 eax = a Mult:
eax = eax * b ....
edx = eax call add
eax = f addl $16, %esp
30 (y) eax = edx + eax movl %eax, -16(%ebp)
20 (x) // eax = a*b + f movl 8(%ebp), %eax
imull 12(%ebp), %eax
Retadd mult()
movl %eax, %edx
Y(prev ebp) movl -16(%ebp), %eax
50 addl %edx, %eax
movl %eax, -12(%ebp)
ebp movl -12(%ebp), %eax
leave
esp ret
Stack X
b
int mult(int a, int b) {
a int c, d = 20, e = 30, f;
RetAddr main() f = add(d, e);
c = a * b + f;
X (prev ebp) return c;
Y
}
int add(int x, int y) {
int z;
c = eax z = x + y;
f = 50 return z;
}
e = 30
d = 20 // eax = a*b + f Mult:
....
call add
addl $16, %esp
30 (y) movl %eax, -16(%ebp)
Again
20 (x) some movl 8(%ebp), %eax
redundant imull 12(%ebp), %eax
Retadd mult()
code movl %eax, %edx
Y(prev ebp) movl -16(%ebp), %eax
50 addl %edx, %eax
movl %eax, -12(%ebp)
ebp movl -12(%ebp), %eax
leave
esp ret
Stack X
b
int mult(int a, int b) {
a int c, d = 20, e = 30, f;
RetAddr main() f = add(d, e);
c = a * b + f;
X (prev ebp) return c;
Y
}
int add(int x, int y) {
int z;
c = eax z = x + y;
f = 50 return z;
}
e = 30
d = 20 After leave Mult:
// eax = a*b + f ....
call add
addl $16, %esp
30 (y) movl %eax, -16(%ebp)
20 (x) movl 8(%ebp), %eax
imull 12(%ebp), %eax
Retadd mult()
movl %eax, %edx
Y(prev ebp) movl -16(%ebp), %eax
50 addl %edx, %eax
movl %eax, -12(%ebp)
ebp movl -12(%ebp), %eax
leave
esp ret
Lessons

Calling function (caller)

Pushes arguments on stack , copies values

On call

Return IP is pushed

Initially in called function (callee)

Old ebp is pushed

ebp = stack

Stack is decremented to make space for local
variables
Lessons

Before Return

Ensure that result is in ‘eax

On Return

stack = ebp

Pop ebp (ebp = old ebp)

On ‘ret’

Pop ‘return IP’ and go back in old function
Lessons

This was a demonstration for a

User program, compiled with GCC, On Linux

Followed the conventions we discussed earlier

Applicable to

C programs which work using LIFO function calls

Compiler can’t generate code using this
mechanism for

Functions like fork(), exec(), scheduler(), etc.

Boot code of OS
Memory Management
X86 address : protected mode
address translation
X86 paging
Segmentation + Paging

Selector value is
implicit based on
address being
accessed.
Instruction: CS
Stack Variable: SS
Data: DS
etc.
GDT Entry
Page Directory Entry (PDE)
Page Table Entry (PTE)
Segment selector

EFLAGS register
CR0

PG: Paging enabled or not WP: Write protecion on/off

PE: Protection Enabled --> protected mode.

CR2
CR3

CR4
mmu.h : paging related macros
#define PTXSHIFT 12 // offset of PTX in a linear address
#define PDXSHIFT 22 // offset of PDX in a linear address
#define PDX(va) (((uint)(va) >> PDXSHIFT) & 0x3FF)// page
directory index
#define PTX(va) (((uint)(va) >> PTXSHIFT) & 0x3FF)// page table
index
// construct virtual address from indexes and offset
#define PGADDR(d, t, o) ((uint)((d) << PDXSHIFT | (t) << PTXSHIFT |
(o)))
// +--------10------+-------10-------+---------12----------+
// | Page Directory | Page Table | Offset within Page |
// | Index | Index | |
// +----------------+----------------+---------------------+
// \--- PDX(va) --/ \--- PTX(va) --/
mmu.h : paging related macros
// Page directory and page table constants.
#define NPDENTRIES 1024 // #
directory entries per page directory
#define NPTENTRIES 1024 // # PTEs
per page table
#define PGSIZE 4096 // bytes
mapped by a page
#define PGROUNDUP(sz) (((sz)+PGSIZE-1) &
~(PGSIZE-1))
#define PGROUNDDOWN(a) (((a)) & ~(PGSIZE-1))
mmu.h : paging related macros
// Page table/directory entry flags.
#define PTE_P 0x001 // Present
#define PTE_W 0x002 // Writeable
#define PTE_U 0x004 // User
#define PTE_PS 0x080 // Page Size

// Address in page table or page directory entry

#define PTE_ADDR(pte) ((uint)(pte) & ~0xFFF) //
get all but last 12 bits
#define PTE_FLAGS(pte) ((uint)(pte) & 0xFFF) //
get last 12 bits
mmu.h : Segmentation related
macros
// various segment selectors.
#define SEG_KCODE 1 // kernel code
#define SEG_KDATA 2 // kernel data+stack
#define SEG_UCODE 3 // user code
#define SEG_UDATA 4 // user data+stack
#define SEG_TSS 5 // this process's task
state
mmu.h : Segmentation related
macros
// various segment selectors.
#define SEG_KCODE 1 // kernel code
#define SEG_KDATA 2 // kernel data+stack
#define SEG_UCODE 3 // user code
#define SEG_UDATA 4 // user data+stack
#define SEG_TSS 5 // this process's task state
#define NSEGS 6
mmu.h : Segmentation related
macros
struct segdesc { // 64 bit in size
uint lim_15_0 : 16; // Low bits of segment limit
uint base_15_0 : 16; // Low bits of segment base address
uint base_23_16 : 8; // Middle bits of segment base address
uint type : 4; // Segment type (see STS_ constants)
uint s : 1; // 0 = system, 1 = application
uint dpl : 2; // Descriptor Privilege Level
uint p : 1; // Present
uint lim_19_16 : 4; // High bits of segment limit
uint avl : 1; // Unused (available for software use)
uint rsv1 : 1; // Reserved
uint db : 1; // 0 = 16-bit segment, 1 = 32-bit segment
uint g : 1; // Granularity: limit scaled by 4K when set
uint base_31_24 : 8; // High bits of segment base address
};
mmu.h : Segmentation related
code
// Application segment type bits
#define STA_X 0x8 // Executable segment
#define STA_W 0x2 // Writeable (non-executable
segments)
#define STA_R 0x2 // Readable (executable
segments)

// System segment type bits

#define STS_T32A 0x9 // Available 32-bit TSS
#define STS_IG32 0xE // 32-bit Interrupt Gate
#define STS_TG32 0xF // 32-bit Trap Gate
Handling Traps
Handling traps

Transition from user mode to kernel mode

On a system call

On a hardware interrupt

User program doing illegal work (exception)

Actions needed, particularly w.r.t. to hardware
interrupts

Change to kernel mode

Kernel to work with devices, if needed

Kernel to understand interface of device
Handling traps

Actions needed on a trap

Save the processor’s registers (context) for future
use

Set up the system to run kernel code (kernel context)

Start kernel in appropriate place (sys call, intr
handler, etc)

Kernel to get all info related to event (which block I/O
done?, which sys call called, which process did
exception and what type, get arguments to system
call, etc)

Privilege level

The x86 has 4 protection levels, numbered 0
(most privilege) to 3 (least privilege).

In practice, most operating systems use
only 2 levels: 0 and 3, which are then called
kernel mode and user mode, respectively.

The current privilege level with which the
x86 executes instructions is stored in %cs
register, in the field CPL.
Privilege level

Changes automatically on
“int” instruction
hardware interrupt
exeception

Changes back on
iret

“int” 10 --> makes 10th hardware interrupt. S/w
interrupt can be used to create hardware interrupt’

Xv6 uses “int 64” for actual system calls
Interrupt Descriptor Table (IDT)

IDT defines intertupt handlers

Has 256 entries

each giving the %cs and %eip to be used when handling the
corresponding interrupt.

Interrupts 0-31 are defined for software exceptions,
like divide errors or attempts to access invalid
memory addresses.

Xv6 maps the 32 hardware interrupts to the range 32-
63

and uses interrupt 64 as the system call interrupt
Interrupt Descriptor Table (IDT)
entries
// Gate descriptors for interrupts and traps
struct gatedesc {
uint off_15_0 : 16; // low 16 bits of offset in
segment
uint cs : 16; // code segment selector
uint args : 5; // # args, 0 for interrupt/trap
gates
uint rsv1 : 3; // reserved(should be zero I
guess)
uint type : 4; // type(STS_{IG32,TG32})
uint s : 1; // must be 0 (system)
uint dpl : 2; // descriptor(meaning new)
privilege level
uint p : 1; // Present
uint off_31_16 : 16; // high bits of offset in segment
};
Setting IDT entries
void
tvinit(void)
{
int i;
for(i = 0; i < 256; i++)
SETGATE(idt[i], 0, SEG_KCODE<<3, vectors[i], 0);
SETGATE(idt[T_SYSCALL], 1, SEG_KCODE<<3,
vectors[T_SYSCALL], DPL_USER);
/* value 1 in second argument --> don't disable
interrupts
* DPL_USER means that processes can raise
this interrupt. */
initlock(&tickslock, "time");
}
Setting IDT entries
#define SETGATE(gate, istrap, sel, off, d) \
{ \
(gate).off_15_0 = (uint)(off) & 0xffff; \
(gate).cs = (sel); \
(gate).args = 0; \
(gate).rsv1 = 0; \
(gate).type = (istrap) ? STS_TG32 : STS_IG32; \
(gate).s = 0; \
(gate).dpl = (d); \
(gate).p = 1; \
(gate).off_31_16 = (uint)(off) >> 16; \
}
Setting IDT entries
Vectors.S trapasm.S
# generated by vectors.pl - #include "mmu.h"
do not edit
# vectors.S sends all traps
# handlers here.
.globl alltraps .globl alltraps
.globl vector0 alltraps:
vector0:
# Build trap frame.
pushl $0
pushl %ds
pushl $0
pushl %es
jmp alltraps
pushl %fs
.globl vector1
pushl %gs
vector1:
pushl $0 Pushal
pushl $1 ....
jmp alltraps
How will interrupts be handled?
On int instruction/interrupt
the CPU does this:

Fetch the n’th descriptor from 
Push %ss. // optional
the IDT, where n is the
argument of int. 
Push %esp. // optional

Check that CPL in %cs is <= 
Push %eflags.
DPL, where DPL is the privilege
level in the descriptor. 
Push %cs.

Save %esp and %ss in CPU- 
Push %eip.
internal registers, but only if the
target segment selector’s PL < 
Clear the IF bit in %eflags,
CPL. but only on an interrupt.

Switching from user mode to
kernel mode. Hence save user 
Set %cs and %eip to the
code’s SS and ESP
values in the descriptor.

Load %ss and %esp from a task
segment descriptor.

Stack changes to kernel stack
now
After “int” ‘s job is done

IDT was already set

Remember vectors.S

So jump to 64th entry in vector’s
vector64:
pushl $0
pushl $64
jmp alltraps

So now stack has ss, esp,eflags, cs, eip, 0 (for error code),
64

Next run alltraps from trapasm.S
# Build trap frame.
pushl %ds alltraps:
pushl %es
pushl %fs 
Now stack contains
pushl %gs 
ss, esp,eflags, cs, eip, 0
pushal // push all gen purpose (for error code), 64, ds,
regs es, fs, gs, eax, ecx, edx,
# Set up data segments. ebx, oesp, ebp, esi, edi
movw $(SEG_KDATA<<3), %ax 
This is the struct
trapframe !
movw %ax, %ds

So the kernel stack now
movw %ax, %es contains the trapframe
# Call trap(tf), where tf=%esp 
Trapframe is a part of
pushl %esp # first arg to trap() kernel stcak
call trap
addl $4, %esp
void
trap(struct trapframe *tf) trap()
{
if(tf->trapno == T_SYSCALL){ 
Argument is trapframe
if(myproc()->killed) 
In alltraps
exit(); 
Before “call trap”, there
myproc()->tf = tf; was “push %esp” and
syscall(); stack had the trapframe
if(myproc()->killed)

Remember calling
convention --> when a
exit(); function is called, the
return; stack contains the
} arguments in reverse
order (here only 1 arg)
switch(tf->trapno){
.....
trap()
Timer
Has a switch



wakeup(&ticks)

switch(tf->trapno) 
IDE: disk interrupt

Ideintr()

Q: who set this 
KBD
trapno? 
Kbdintr()
COM1
Depending on the



Uatrintr()
type of trap 
If Timer
Call yield() -- calls sched()
Call interrupt



If process was killed (how is that
handler done?

Call exit()!

Stack had (trapframe)
ss, esp,eflags, cs, eip, 0 (for
when trap() returns


error code), 64, ds, es, fs, gs,

eax, ecx, edx, ebx, oesp, ebp,
esi, edi, esp

#Back in alltraps
call trap 
add $4 %esp
addl $4, %esp 
esp

# Return falls through to trapret...


popal
.globl trapret 
eax, ecx, edx, ebx, oesp, ebp,
trapret: esi, edi
popal 
Then gs, fs, es, ds
popl %gs
popl %fs 
add $0x8, %esp
popl %es 
0 (for error code), 64
popl %ds
addl $0x8, %esp # trapno and errcode

iret
iret 
ss, esp,eflags, cs, eip,

Extra slides
To be Ignored
bootasm.S after “lgdt gdtdesc”
till jump to “entry”

Logical
Address =
offset Physical
Address

CS, SS, etc.

Selector Base Limit Permissions

3
2 0 4GB Write
1 0 4GB Read, Execute
DS 0 0 0 0
GDT
SS

CS
From entry: RAM
Till: inside main(), before kvmalloc()

Logical
Address =
offset Linear
Address

4MB
CS, SS, etc. 0
Selector Base Limit Permissions

3
2 0 4GB Write 512 0 P,W,PS
1 0 4GB Read, Execute .
DS 0 0 0 0 .
GDT .
SS
3
2
CS
1
CR3 0 0 P,W,PS
entrypgdir
From entry: RAM
Till: inside main(), before kvmalloc()
Physical
Addr
Logical
Address =
offset Dir Offset

4MB
CS, SS, etc. 0
Selector Base Limit Permissions

3
2 0 4GB Write 512 0 P,W,PS
1 0 4GB Read, Execute .
DS 0 0 0 0 .
GDT .
SS
3
2
CS
1
Even now, every
CR3 0 0 P,W,PS
Logical address =
Physical address, but entrypgdir
through Page dir
Free List in XV6 Obtained after main() -> kinit1()

Pages obtained Between

end = 801154a8 = 2049 MB to P2V(4MB) = 2052 MB
Remember
Right now Logical = Physical address.

lock
kmem
uselock Seen
Actually like
run *freelist independent
this in memory
ly

run * run * run *

kmap[] mappings done in kvmalloc(). This shows segmentwise,
entries are done in page directory and page table for corresponding
VA-> PA mappings
4GB

DEVSPACE=3.96GB
4GB
Un
mapped DEVSPACE=3.96GB
KERNBASE+PHYSTOP=
2.224GB= 2272MB Unused
Kernel
data +
memory PHYSTOP = 224MB
data= 2049.0.3125 MB Kernel
Kernel data +
code + memory
RO Data
KERNBASE+EXTMEM=2049
Kernel 1.03125 MB = 1MB +
MB I/O
code + data
Space
RO Data
KERNBASE=2048MB
I/O EXTMEM=1MB
Process Space
address
space 0x80108000 =data
0
= 2049.3125 MB
0 Is obtained from
kernel.sym
After kvmalloc() in main() RAM
Physical
Addr
Logical Linear
Address = Address
offset Dir pg Offset

4MB

CS, SS, etc. 0

Selector Base Limit Permissions

3
2 0 4GB Write
1 0 4GB Read, Execute
DS 0 0 0 0
GDT
SS
Page
Table
CS
Now Linear Address =
CR3
Logical Address !=
Physical Address kpgdir
After seginit() in main(). RAM
On the processor where we started booting Physical
Addr
Logical Linear
Address = Address
offset Dir pg Offset

4MB

CS, SS, etc. 0

Selector Base Limit Permissions DPL
4 0 4GB Write 3
3 0 4GB Read, Execute 3
2 0 4GB Write 0
1 0 4GB Read, Execute 0
DS 0 0 0 0
GDT
SS
Page
Table
CS
Now Linear Address =
CR3
Logical Address !=
Physical Address kpgdir
Segmentation + Paging

COAL Lab#05
100% (1)
COAL Lab#05
23 pages
Lisp Interpreter in Rust
From Everand
Lisp Interpreter in Rust
Vishal Patil
1/5 (1)
Sap Variant Configuration
100% (3)
Sap Variant Configuration
13 pages
Autonomic Computing - Principles, Design and Implementation
50% (2)
Autonomic Computing - Principles, Design and Implementation
298 pages
Lab 4: Introduction To x86 Assembly
No ratings yet
Lab 4: Introduction To x86 Assembly
14 pages
Theory Viet.
No ratings yet
Theory Viet.
156 pages
A Crash Course On x86 Disassembly
No ratings yet
A Crash Course On x86 Disassembly
23 pages
lecture01-intro
No ratings yet
lecture01-intro
67 pages
Os Notes Mit
No ratings yet
Os Notes Mit
9 pages
x86_assembly_reversing_cheat_sheet
No ratings yet
x86_assembly_reversing_cheat_sheet
1 page
Intel Cheat Sheet
No ratings yet
Intel Cheat Sheet
8 pages
01 Lecture02
No ratings yet
01 Lecture02
78 pages
x86 Instructions - Windows drivers _ Microsoft Learn
No ratings yet
x86 Instructions - Windows drivers _ Microsoft Learn
14 pages
Introduction To Intel x86 Assembly, Architecture, Applications, & Alliteration
No ratings yet
Introduction To Intel x86 Assembly, Architecture, Applications, & Alliteration
25 pages
Lesson 2.1 - Intro + x86-x64 Assembly
No ratings yet
Lesson 2.1 - Intro + x86-x64 Assembly
33 pages
x86 Assembly Language: Debugging Survival Guide (x86-DSG) Divyanand Kutagulla
No ratings yet
x86 Assembly Language: Debugging Survival Guide (x86-DSG) Divyanand Kutagulla
52 pages
x86 Assembly Tutorial: COS 318: Fall 2017
No ratings yet
x86 Assembly Tutorial: COS 318: Fall 2017
23 pages
IA32
No ratings yet
IA32
45 pages
IA32
No ratings yet
IA32
45 pages
Basic Architecture Ia32 x86
No ratings yet
Basic Architecture Ia32 x86
41 pages
Assembly Language
No ratings yet
Assembly Language
26 pages
Modern Computer Architecture and Programming in Assembly Language - TCM - 183 - 1309076
No ratings yet
Modern Computer Architecture and Programming in Assembly Language - TCM - 183 - 1309076
131 pages
ch4 Handouts
No ratings yet
ch4 Handouts
72 pages
Learn CyberTalents
No ratings yet
Learn CyberTalents
16 pages
Osd Co4
No ratings yet
Osd Co4
319 pages
Practical Malware Analysis: CH 4: A Crash Course in x86 Disassembly
No ratings yet
Practical Malware Analysis: CH 4: A Crash Course in x86 Disassembly
50 pages
Lecture 3: MIPS Instruction Set
No ratings yet
Lecture 3: MIPS Instruction Set
24 pages
Assembly - Procedures
No ratings yet
Assembly - Procedures
5 pages
14 Assembly Instructions
No ratings yet
14 Assembly Instructions
9 pages
Assembly Programming
No ratings yet
Assembly Programming
6 pages
Assembly Manual
No ratings yet
Assembly Manual
4 pages
Week 4 Lecture PART 2
No ratings yet
Week 4 Lecture PART 2
37 pages
Code Generation Issues: Compilation 2014
No ratings yet
Code Generation Issues: Compilation 2014
19 pages
Assembly-Simple Instructions Manual
No ratings yet
Assembly-Simple Instructions Manual
9 pages
Week 3 - Lecture
No ratings yet
Week 3 - Lecture
68 pages
Offensive Security & Reverse Engineering (OSRE) : Ali Hadi
No ratings yet
Offensive Security & Reverse Engineering (OSRE) : Ali Hadi
110 pages
IntroductionToIntelx86 Part1 PDF
No ratings yet
IntroductionToIntelx86 Part1 PDF
113 pages
4
No ratings yet
4
6 pages
EE209A - 24 15 Assembly2
No ratings yet
EE209A - 24 15 Assembly2
45 pages
U1 - 8051 ALP Instructions
No ratings yet
U1 - 8051 ALP Instructions
70 pages
Intro x86 Part 2: More Examples and Analysis: Xeno Kovah - 2009/2010 Xkovah at Gmail
No ratings yet
Intro x86 Part 2: More Examples and Analysis: Xeno Kovah - 2009/2010 Xkovah at Gmail
41 pages
IntroductionToIntelx86 Part2 PDF
No ratings yet
IntroductionToIntelx86 Part2 PDF
41 pages
CS 4740/6740 Network Security: Lecture 7: Memory Corruption (Assembly Review, Basic Exploits)
No ratings yet
CS 4740/6740 Network Security: Lecture 7: Memory Corruption (Assembly Review, Basic Exploits)
189 pages
Introduction To Intel x86 Assembly, Architecture, Applications, & Alliteration
No ratings yet
Introduction To Intel x86 Assembly, Architecture, Applications, & Alliteration
113 pages
Modern Compiler Design, Associated Supplemental Materials, Generating x86, David Galle - x86 PDF
No ratings yet
Modern Compiler Design, Associated Supplemental Materials, Generating x86, David Galle - x86 PDF
31 pages
CSE 2421 Study Sheet
No ratings yet
CSE 2421 Study Sheet
3 pages
att0
No ratings yet
att0
13 pages
Rema Lab 2 Mihirpatel k042
No ratings yet
Rema Lab 2 Mihirpatel k042
6 pages
Assembly
No ratings yet
Assembly
27 pages
AL Basics
No ratings yet
AL Basics
27 pages
Intel x86 Instruction Set Architecture: Dr. Nihat Adar
No ratings yet
Intel x86 Instruction Set Architecture: Dr. Nihat Adar
41 pages
chapter2_2
No ratings yet
chapter2_2
25 pages
Assembly - Arithmetic Instructions - Tutorialspoint
No ratings yet
Assembly - Arithmetic Instructions - Tutorialspoint
10 pages
Lecture Slides 04 041 x86 Mov Swap
No ratings yet
Lecture Slides 04 041 x86 Mov Swap
16 pages
EE209A - 24 16 AssemblyFunctions
No ratings yet
EE209A - 24 16 AssemblyFunctions
64 pages
Assembly Language: Function Calls: Goals of This Lecture
No ratings yet
Assembly Language: Function Calls: Goals of This Lecture
21 pages
05 Software Security 2
No ratings yet
05 Software Security 2
33 pages
Do Hoang Tu - Operating System From 0 To 1 (2022) - Removed - Removed - Removed
No ratings yet
Do Hoang Tu - Operating System From 0 To 1 (2022) - Removed - Removed - Removed
21 pages
Roadmap: Java: C
No ratings yet
Roadmap: Java: C
96 pages
Practical Reverse Engineering: x86, x64, ARM, Windows Kernel, Reversing Tools, and Obfuscation
From Everand
Practical Reverse Engineering: x86, x64, ARM, Windows Kernel, Reversing Tools, and Obfuscation
Bruce Dang
No ratings yet
Learn C++
From Everand
Learn C++
Durgesh
4.5/5 (9)
C Programming
From Everand
C Programming
Netra
No ratings yet
Ext2 File System Assignments and Description Lab
No ratings yet
Ext2 File System Assignments and Description Lab
14 pages
Lecture-1-Introduction To Natural Language Processing-2021
No ratings yet
Lecture-1-Introduction To Natural Language Processing-2021
46 pages
Pilation Linking Loading
No ratings yet
Pilation Linking Loading
3 pages
Threads, Signals: Abhijit A M
No ratings yet
Threads, Signals: Abhijit A M
35 pages
15 Synchronization
No ratings yet
15 Synchronization
120 pages
13 File-Systems
No ratings yet
13 File-Systems
69 pages
10.memory Management 2
No ratings yet
10.memory Management 2
47 pages
8.processes, Ipc
No ratings yet
8.processes, Ipc
78 pages
System Calls: Abhijit A. M
No ratings yet
System Calls: Abhijit A. M
16 pages
5 Xv6-Notes
No ratings yet
5 Xv6-Notes
316 pages
3.hardware and Software Interrupts
No ratings yet
3.hardware and Software Interrupts
23 pages
1.presentation Command Line and Shell Programming
No ratings yet
1.presentation Command Line and Shell Programming
48 pages
B9 (Yingshu Li My T. Thai Weili Wu) Wireless Sensor
No ratings yet
B9 (Yingshu Li My T. Thai Weili Wu) Wireless Sensor
453 pages
Misruda Introduction To Systems Software
No ratings yet
Misruda Introduction To Systems Software
140 pages
Business Benefits of SAP S4HANA
100% (1)
Business Benefits of SAP S4HANA
9 pages
Digital Electronics Timing Diagrams
No ratings yet
Digital Electronics Timing Diagrams
11 pages
Digital Modulation Techniques
No ratings yet
Digital Modulation Techniques
15 pages
Posted Jul 29, 2015 by ,: Jordan Crook Fitz Tepper
No ratings yet
Posted Jul 29, 2015 by ,: Jordan Crook Fitz Tepper
5 pages
CSI DHL Resilience360 Google Case Study en
No ratings yet
CSI DHL Resilience360 Google Case Study en
2 pages
Syntax Analysis
No ratings yet
Syntax Analysis
53 pages
En Itil FND 2011 Samplepaperc v1.3
No ratings yet
En Itil FND 2011 Samplepaperc v1.3
16 pages
Net Neutrality: Protecting The Internet: ©2011 Luke Seelenbinder
0% (1)
Net Neutrality: Protecting The Internet: ©2011 Luke Seelenbinder
8 pages
3.2 Interoperability Testing v1.03 w USB Type-C
No ratings yet
3.2 Interoperability Testing v1.03 w USB Type-C
145 pages
5-Module 4-Cloud Environments - Case study_ One cloud service provider per service model-02-09-2024
No ratings yet
5-Module 4-Cloud Environments - Case study_ One cloud service provider per service model-02-09-2024
98 pages
Logtrack
No ratings yet
Logtrack
52 pages
Derecho Civil General y Personas Arturo Valencia Zea PDF
No ratings yet
Derecho Civil General y Personas Arturo Valencia Zea PDF
3 pages
3 R Markdown
No ratings yet
3 R Markdown
5 pages
Beginning Entity Framework Core 5: From Novice to Professional 1st Edition Eric Vogel pdf download
100% (3)
Beginning Entity Framework Core 5: From Novice to Professional 1st Edition Eric Vogel pdf download
63 pages
Android Manifest
No ratings yet
Android Manifest
12 pages
A Detailed Study On Text Mining Techniques
No ratings yet
A Detailed Study On Text Mining Techniques
4 pages
05 Implement Advanced Search Features in Azure AI Search
No ratings yet
05 Implement Advanced Search Features in Azure AI Search
56 pages
Beamex MC5 Data Logging Option Manual ENG
No ratings yet
Beamex MC5 Data Logging Option Manual ENG
2 pages
Troubleshooting
No ratings yet
Troubleshooting
5 pages
Functional Programming in Python Syllabus
No ratings yet
Functional Programming in Python Syllabus
3 pages
LAb 5 Windows Forensics
No ratings yet
LAb 5 Windows Forensics
7 pages
CP1W Mad11
No ratings yet
CP1W Mad11
14 pages
Hoang Le
No ratings yet
Hoang Le
6 pages
Clarion: A Global Brand That Puts Your Future First
No ratings yet
Clarion: A Global Brand That Puts Your Future First
6 pages
1-Object Oriented Programming Development-University of Luton
100% (1)
1-Object Oriented Programming Development-University of Luton
21 pages
Program For Matrix Keyboard Interfacing: Objective: Requirements: Theory
No ratings yet
Program For Matrix Keyboard Interfacing: Objective: Requirements: Theory
3 pages
Basic Application Software: Mcgraw-Hill
No ratings yet
Basic Application Software: Mcgraw-Hill
34 pages

7.x86 - Architecture

Uploaded by

7.x86 - Architecture

Uploaded by

Basics of X86 architecture

Credits: Notes by Prof. Sorav Bansal, https://www.felixcloutier.com/x86/, Intel Documentation

main() { ... a(); ...} a(){ ... b();... c(); ...}

Each “frame’ contains the local variables for that function

Consider this C code

esp * Control is here */

PG: Paging enabled or not WP: Write protecion on/off

// Address in page table or page directory entry

// System segment type bits

error code), 64, ds, es, fs, gs,

# Return falls through to trapret...

CS, SS, etc.

Pages obtained Between

run * run * run *

CS, SS, etc. 0

CS, SS, etc. 0

You might also like