Double Mersenne number: Difference between revisions

From formulasearchengine
Jump to navigation Jump to search
en>Kirachinmoku
m Reverted edits by 59.161.114.243 (talk) to last revision by Kirachinmoku (HG)
en>David Eppstein
Undid revision 638145976 by 107.72.164.26 (talk) external links should not be used for formulas
 
Line 1: Line 1:
{{more footnotes|date=January 2010}}
I'm a 34 years old, married and work at the college (Film Studies).<br>In my free time I learn Dutch. I've been there and look forward to returning sometime in the future. I love to read, preferably on my ipad. I like to watch Doctor Who  [http://clairvoyantburnley.co.uk/content/starting-college-read-article-bookbyte-promo Bookbyte Coupon 20% Off] and Doctor Who as well as documentaries about anything technological. I like Games Club - Dungeons and   [http://3bbc.com/index.php?do=/profile-4216/info/ Bookbyte Coupon Codes] Dragons, Monopoly, Etc..<br><br>my webpage - [http://demo.hakaveret.org.il/he/node/2962 Bookbyte Bonus Code]
{{lowercase|title=x86 instruction listings}}
The [[x86]] [[instruction set]] has been extended several times, introducing wider registers and datatypes and/or new functionality<ref name=":0">{{cite web
| url = http://www.intel.com/content/www/us/en/processors/processor-identification-cpuid-instruction-note.html?wapkw=processor-identification-cpuid-instruction
| title = Re: Intel® Processor Identification and the CPUID Instruction
| accessdate = 2013-04-21
}}</ref>
 
== x86 integer instructions ==
 
This is the full 8086/8088 instruction set, but most, if not all of these instructions are available in 32-bit mode, they just operate on 32-bit registers (eax, ebx, etc.) and values instead of their 16-bit (ax, bx, etc.) counterparts. See also [[x86 assembly language]] for a quick tutorial for this processor family.
The updated instruction set is also grouped according to architecture (i386, i486, i686) and more generally is referred to as x86_32 and x86_64 (also known as AMD64).
 
=== Original 8086/8088 instructions ===
 
{| class="wikitable sortable"
|+ Original 8086/8088 instruction set
! Instruction !! Meaning !! Notes !! Opcode
|-
| [[Intel BCD opcodes|AAA]] || ASCII adjust AL after addition || used with unpacked [[binary coded decimal]]
Example:
 
<nowiki>;</nowiki>Assume AL=0011 0101,ASCII 5;
 
BL=0011 1001,ASCII  9 
|
|-
| AAD || ASCII adjust AX before division || 8086/8088 datasheet documents only base 10 version of the AAD instruction ([[opcode]] 0xD5 0x0A), but any other base will work. Later Intel's documentation has the generic form too. NEC V20 and V30 (and possibly other NEC V-series CPUs) always use base 10, and ignore the argument, causing a number of incompatibilities  ||
|-
| AAM || ASCII adjust AX after multiplication || rowspan=2 |Only base 10 version is documented, see notes for AAD  ||
|-
| AAS || ASCII adjust AL after subtraction  ||
|-
| ADC || Add with carry || <tt>destination := destination + source + [[Carry flag|carry_flag]]</tt>  ||
|-
| ADD || Add || (1) <tt>r/m += r/imm;</tt> (2) <tt>r += m/imm;</tt> ||
|-
| AND || [[Logical conjunction|Logical AND]] || (1) <tt>r/m &= r/imm;</tt> (2) <tt>r &= m/imm;</tt>  ||
|-
| CALL || Call procedure || {{code|lang=asm|push eip + 2 ;  jmp operand}} ||
|-
| CBW || Convert byte to word ||  ||
|-
| CLC || Clear [[carry flag]] || <tt>CF = 0;</tt>  ||
|-
| CLD || Clear [[direction flag]] || <tt>DF = 0;</tt>  ||
|-
| [[CLI (x86 instruction)|CLI]] || Clear [[IF (x86 flag)|interrupt flag]] || <tt>IF = 0;</tt>  ||
|-
| CMC || Complement carry flag ||  ||
|-
| CMP || Compare operands ||  ||
|-
| CMPSB || Compare bytes in memory ||  ||
|-
| CMPSW || Compare words ||  ||
|-
| CWD || Convert word to doubleword ||    ||
|-
| [[Intel BCD opcodes|DAA]] || Decimal adjust AL after addition || (used with packed [[binary coded decimal]])  ||
|-
| [[Intel BCD opcodes|DAS]] || Decimal adjust AL after subtraction ||  ||
|-
| DEC || Decrement by 1 ||  ||
|-
| DIV || Unsigned divide || <tt>DX:AX = DX:AX / r/m;</tt> resulting DX = remainder ||
|-
| ESC || Used with [[floating-point unit]] ||  ||
|-
| [[HLT]] || Enter halt state ||  || 0xF4
|-
| IDIV || Signed divide || <tt>DX:AX = DX:AX / r/m;</tt> resulting DX = remainder ||
|-
| IMUL || Signed multiply || (1) <tt>DX:AX = AX * r/m;</tt> (2) AX = AL * r/m ||
|-
| IN || Input from port || (1) <tt>AL = port[imm];</tt> (2) <tt>AL = port[DX];</tt> (3) <tt>AX = port[DX];</tt> ||
|-
| INC || Increment by 1 ||  ||
|-
| [[INT (x86 instruction)|INT]] || Call to [[interrupt]] ||  ||
|-
| INTO || Call to interrupt if overflow ||  ||
|-
| IRET || Return from interrupt ||    ||
|-
| Jcc || [[Branch (computer science)|Jump if condition]] || (''JA, JAE, JB, JBE, JC, JE, JG, JGE, JL, JLE, JNA, JNAE, JNB, JNBE, JNC, JNE, JNG, JNGE, JNL, JNLE, JNO, JNP, JNS, JNZ, JO, JP, JPE, JPO, JS, JZ'')  ||
|-
| JCXZ || Jump if CX is zero ||  ||
|-
| [[JMP (x86 instruction)|JMP]] || Jump ||    ||
|-
| LAHF || Load flags into AH register ||    ||
|-
| LDS || Load pointer using DS ||  ||
|-
| LEA || [[Load Effective Address]] ||  ||
|-
| LES || Load ES with pointer ||  ||
|-
| LOCK || Assert BUS LOCK# signal || (for multiprocessing)  ||
|-
| LODSB || Load string byte || {{code|lang=c|1=if (DF==0) AL = *SI++; else AL = *SI--;}}  ||
|-
| LODSW || Load string word || {{code|lang=c|1=if (DF==0) AX = *SI++; else AX = *SI--;}}  ||
|-
| LOOP/LOOPx || Loop control || (''LOOPE, LOOPNE, LOOPNZ, LOOPZ'') {{code|lang=c|1=if (x && --CX) goto lbl;}}  ||
|-
| [[MOV (x86 instruction)|MOV]] || Move|| copies data from one location to another, (1) <tt>r/m = r;</tt> (2) <tt>r = r/m;</tt>  ||
|-
| MOVSB || Move byte from string to string || {{code|lang=c|1=if (DF==0) *(byte*)DI++ = *(byte*)SI++; else *(byte*)DI-- = *(byte*)SI--;}} ||
|-
| MOVSW || Move word from string to string || {{code|lang=c|1=if (DF==0) *(word*)DI++ = *(word*)SI++; else *(word*)DI-- = *(word*)SI--;}} ||
|-
| MUL || Unsigned multiply || (1) <tt>DX:AX = AX * r/m;</tt> (2) <tt>AX = AL * r/m;</tt> ||
|-
| NEG || Two's complement negation || {{code|lang=c|1=r/m *= -1;}} ||
|-
| [[NOP]] || No operation || opcode equivalent to XCHG EAX, EAX || 0x90
|-
| NOT || Negate the operand, [[Bitwise_operation#NOT|logical NOT]] || <tt>r/m ^= -1;</tt>  ||
|-
| OR || [[Logical disjunction|Logical OR]] || (1) <source lang="c" enclose="none">r/m |= r/imm;</source> (2) <source lang="c" enclose="none">r |= m/imm;</source> ||
|-
| OUT || Output to port || (1) <tt>port[imm] = AL;</tt> (2) <tt>port[DX] = AL;</tt> (3) <tt>port[DX] = AX;</tt>  ||
|-
| POP || Pop data from [[Stack (data structure)|stack]] || <tt>*SP++ = r/m;</tt> POP CS (opcode 0x0F) works only on 8086/8088. Later CPUs use 0x0F as a prefix for newer instructions. || 0x0F
|-
| POPF || Pop data from [[FLAGS register (computing)|flags register]] || <tt>*SP++ = flags;</tt>  ||
|-
| PUSH || Push data onto stack || {{code|lang=c|1=*--SP = r/m;}} ||
|-
| PUSHF || Push flags onto stack || {{code|lang=c|1=*--SP = flags;}} ||
|-
| RCL || Rotate left (with carry) ||    ||
|-
| RCR || Rotate right (with carry) ||  ||
|-
| REPxx || Repeat MOVS/STOS/CMPS/LODS/SCAS || (''REP, REPE, REPNE, REPNZ, REPZ'')  ||
|-
| RET || Return from procedure || Not a real instruction. The assembler will translate these to a RETN or a RETF depending on the memory model of the target system. ||
|-
| RETN || Return from near procedure ||  ||
|-
| RETF || Return from far procedure ||  ||
|-
| ROL || Rotate left ||  ||
|-
| ROR || Rotate right ||  ||
|-
| SAHF || Store AH into flags ||  ||
|-
| SAL || [[Arithmetic shift|Shift Arithmetically]] left (signed shift left) || (1) <tt>r/m <<= 1;</tt> (2) <tt>r/m <<= CL;</tt> ||
|-
| SAR || Shift Arithmetically right (signed shift right) || (1) (signed)<tt>r/m >>= 1;</tt> (2) (signed)<tt>r/m >>= CL;</tt> ||
|-
| SBB || Subtraction with borrow || alternative 1-byte encoding of SBB&nbsp;AL,&nbsp;AL is available via [[#Undocumented instructions|undocumented]] SALC instruction  ||
|-
| SCASB || Compare byte string ||  ||
|-
| SCASW || Compare word string ||  ||
|-
| SHL || [[Logical shift|Shift]] left (unsigned shift left) ||  ||
|-
| SHR || Shift right (unsigned shift right) ||  ||
|-
| STC || Set carry flag || <tt>CF = 1;</tt>  ||
|-
| STD || Set direction flag || <tt>DF = 1;</tt>  ||
|-
| [[STI (x86 instruction)|STI]] || Set interrupt flag || {{code|lang=c|1=IF = 1;}} ||
|-
| STOSB || Store byte in string || {{code|lang=c|1=if (DF==0) *ES:DI++ = AL; else *ES:DI-- = AL;}} ||
|-
| STOSW || Store word in string || {{code|lang=c|1=if (DF==0) *ES:DI++ = AX; else *ES:DI-- = AX;}} ||
|-
| SUB || Subtraction || (1) <tt>r/m -= r/imm;</tt> (2) <tt>r -= m/imm;</tt> ||
|-
| [[TEST (x86 instruction)|TEST]] || Logical compare (AND) || (1) <tt>r/m & r/imm;</tt> (2) <tt>r & m/imm;</tt> ||
|-
| WAIT || Wait until not busy || Waits until BUSY# pin is inactive (used with [[floating-point unit]])  ||
|-
| XCHG || Exchange data || {{code|lang=c|1=r :=: r/m;}} ||
|-
| XLAT || Table look-up translation || behaves like <code>MOV AL, [BX+AL]</code>  ||
|-
| XOR || [[Exclusive or|Exclusive OR]] || (1) <tt>r/m ^= r/imm;</tt> (2) <tt>r ^= m/imm;</tt> ||
|}
 
===Added in specific processors===
 
 
====Added with [[Intel 80186|80186]]/[[Intel 80188|80188]]====
{| class="wikitable"
! Instruction !! Meaning !! Notes
|-
| BOUND || Check array index against bounds || raises software interrupt 5 if test fails
|-
| ENTER || Enter stack frame || equivalent to <source lang="asm">
Push (E)BP
Set a temporary value (E)FRAME_PTR:=(E)SP
If LEVEL > 0 then
  Repeat (LEVEL - 1) times:
      (E)BP:=(E)BP - 2
      Push the word pointed to by (E)BP
  End repeat
  Push (E)FRAME_PTR
End if
(E)BP:=(E)FRAME_PTR
(E)SP:=(E)SP - first operand.</source>
|-
| INS || Input from port to string || equivalent to
<source lang="asm">
IN (E)AX, DX
MOV ES:[(E)DI], (E)AX
; adjust (E)DI according to operand size and DF
</source>
|-
| LEAVE || Leave stack frame || equivalent to <source lang="asm">MOV SP, BP
POP BP</source>
|-
| OUTS || Output string to port || equivalent to
<source lang="asm">
MOV (E)AX, DS:[(E)SI]
OUT DX, (E)AX
; adjust (E)SI according to operand size and DF
</source>
|-
| POPA || Pop all general purpose registers from stack || equivalent to
<source lang="asm">POP DI
POP SI
POP BP
POP AX ;no POP SP here, only ADD SP,2
POP BX
POP DX
POP CX
POP AX</source>
|-
| PUSHA || Push all general purpose registers onto stack || equivalent to
<source lang="asm">PUSH AX
PUSH CX
PUSH DX
PUSH BX
PUSH SP
PUSH BP
PUSH SI
PUSH DI</source>
|}
 
====Added with [[80286]]====
{| class="wikitable"
! Instruction !! Meaning !! Notes
|-
| ARPL || Adjust RPL field of selector ||
|-
| CLTS || Clear task-switched flag in register CR0 ||
|-
| LAR || Load access rights byte ||
|-
| LGDT || Load global descriptor table ||
|-
| LIDT || Load interrupt descriptor table ||
|-
| LLDT || Load local descriptor table ||
|-
|-
| LMSW || Load machine status word ||
|-
| [[LOADALL]] || Load all CPU registers, including internal ones such as GDT || Undocumented, 80286 and 80386 only
|-
| LSL || Load segment limit ||
|-
| [[Load Task Register|LTR]] || Load task register ||
|-
| SGDT || Store global descriptor table ||
|-
| SIDT || Store interrupt descriptor table ||
|-
| SLDT || Store local descriptor table ||
|-
| SMSW || Store machine status word ||
|-
| STR || Store task register ||
|-
| VERR || Verify a segment for reading ||
|-
| VERW || Verify a segment for writing ||
|}
 
====Added with [[80386]]====
{| class="wikitable"
! Instruction !! Meaning !! Notes
|-
| BSF || Bit scan forward ||
|-
| BSR || Bit scan reverse ||
|-
| [[Bit Test|BT]] || Bit test ||
|-
| [[Bit Test|BTC]] || Bit test and complement ||
|-
| [[Bit Test|BTR]] || Bit test and reset ||
|-
| [[Bit Test|BTS]] || Bit test and set ||
|-
| CDQ || Convert double-word to quad-word || Sign-extends EAX into EDX, forming the quad-word EDX:EAX. Since (I)DIV uses EDX:EAX as its input, CDQ must be called after setting EAX if EDX is not manually initialized (as in 64/32 division) before (I)DIV.
|-
| CMPSD || Compare string double-word || Compares ES:[(E)DI] with DS:[SI]
|-
| CWDE || Convert word to double-word || Unlike CWD, CWDE sign-extends AX to EAX instead of AX to DX:AX
|-
| INSD || Input from port to string double-word ||
|-
| IRETx || Interrupt return; D suffix means 32-bit return, F suffix means do not generate epilogue code (i.e. LEAVE instruction) || Use IRETD rather than IRET in 32-bit situations
|-
| JECXZ || Jump if ECX is zero ||
|-
| LFS, LGS || Load far pointer ||
|-
| LSS || Load stack segment ||
|-
| LODSD || Load string double-word || can be prefixed with REP
|-
| LOOPW, LOOP''cc''W || Loop, conditional loop || Same as LOOP, LOOP''cc'' for earlier processors
|-
| LOOPD, LOOPccD || Loop while equal || <code>if (''cc'' && --ECX) goto lbl;</code>, ''cc'' = '''Z'''(ero), '''E'''(qual), '''N'''on'''Z'''ero, '''N'''(on)'''E'''(qual)
|-
| MOVSD || Move string double-word || <code>*(dword*)ES:EDI±± = (dword*)ESI±±;</code> (±± depends on DF)
|-
| MOVSX || Move with sign-extension || <code>(long)r = (signed char) r/m;</code> and similar
|-
| MOVZX || Move with zero-extension || <code>(long)r = (unsigned char) r/m;</code> and similar
|-
| OUTSD || Output to port from string double-word || <code> port[DX] = *(long*)ESI±±;</code> (±± depends on DF)
|-
| POPAD || Pop all double-word (32-bit) registers from stack || Does not pop register ESP off of stack
|-
| POPFD || Pop data into EFLAGS register ||
|-
| PUSHAD || Push all double-word (32-bit) registers onto stack ||
|-
| PUSHFD || Push EFLAGS register onto stack ||
|-
| SCASD || Scan string data double-word ||
|-
| SETcc || Set byte to one on condition, zero otherwise || ''(SETA, SETAE, SETB, SETBE, SETC, SETE, SETG, SETGE, SETL, SETLE, SETNA, SETNAE, SETNB, SETNBE, SETNC, SETNE, SETNG, SETNGE, SETNL, SETNLE, SETNO, SETNP, SETNS, SETNZ, SETO, SETP, SETPE, SETPO, SETS, SETZ)''
|-
| SHLD || Shift left double-word ||
|-
| SHRD || Shift right double-word || <code>r1 = r1>>CL ∣ r2<<(32-CL);</code> Instead of CL, immediate 1 can be used
|-
| STOSD || Store string double-word || <code>*ES:EDI±± = EAX;</code> (±± depends on DF, ES cannot be overridden)
|}
 
====Added with [[80486]]====
{| class="wikitable"
! Instruction !! Meaning !! Notes
|-
| BSWAP || Byte Swap || <source lang="c" enclose="none">r = r<<24 | r<<8&0x00FF0000 | r>>8&0x0000FF00 | r>>24;</source> Only works for 32 bit registers
|-
| CMPXCHG || atomic CoMPare and eXCHanGe || See [[Compare-and-swap]]
|-
| INVD || Invalidate Internal Caches || Flush internal caches
|-
| INVLPG || Invalidate [[Translation lookaside buffer|TLB]] Entry || Invalidate TLB Entry for page that contains data specified
|-
| WBINVD || Write Back and Invalidate Cache || Writes back all modified cache lines in the processor's internal cache to main memory and invalidates the internal caches.
|-
| XADD || eXchange and ADD || Exchanges the first operand with the second operand, then loads the sum of the two values into the destination operand.
|}
 
====Added with [[Intel P5|Pentium]]====
 
{| class="wikitable"
! Instruction !! Meaning !! Notes
|-
| [[CPUID]] || CPU IDentification || Returns data regarding processor identification and features, and returns data to the EAX, EBX, ECX, and EDX registers. Instruction functions specified by the EAX register.<ref name=":0" /> This was also added to later [[80486]] processors
|-
| CMPXCHG8B || CoMPare and eXCHanGe 8 bytes || Compare EDX:EAX with m64. If equal, set ZF and load ECX:EBX into m64. Else, clear ZF and load m64 into EDX:EAX.
|-
| [[RDMSR]] || ReaD from [[Model-specific register]] || Load [[Model-specific register|MSR]] specified by ECX into EDX:EAX
|-
| [[RDTSC]] || ReaD Time Stamp Counter || Returns the number of processor ticks since the processor being "ONLINE" (since the last power on of system)
|-
| [[WRMSR]] || WRite to Model-Specific Register || Write the value in EDX:EAX to [[Model-specific register|MSR]] specified by ECX
|-
| RSM [http://www.softeng.rl.ac.uk/st/archive/SoftEng/SESP/html/SoftwareTools/vtune/users_guide/mergedProjects/analyzer_ec/mergedProjects/reference_olh/mergedProjects/instructions/instruct32_hh/vc279.htm]
|| Resume from System Management Mode
|| This was introduced by the i[[386SL#The_i386SL_variant|386SL]] and later and is also in the i[[Intel 80486SL|486SL]] and later. Resumes from [[System Management Mode]] (SMM)
|}
 
====Added with Pentium MMX====
 
{| class="wikitable"
! Instruction !! Meaning !! Notes
|-
| [[RDPMC]] || Read the PMC [Performance Monitoring Counter] || Specified in the ECX register into registers EDX:EAX
|}
 
Also MMX registers and MMX support instructions were added. They are usable for both integer and floating point operations, see below.
 
====Added with [[AMD K6]]====
 
{| class="wikitable"
! Instruction !! Meaning !! Notes
|-
| SYSCALL ||  ||
|-
| SYSRET ||  || functionally equivalent to SYSENTER and SYSEXIT
|}
 
AMD changed the CPUID detection bit for this feature from the K6-II on.
 
====Added with [[Pentium Pro]]====
 
{| class="wikitable"
! Instruction !! Meaning !! Notes
|-
| CMOVcc || Conditional move || ''(CMOVA, CMOVAE, CMOVB, CMOVBE, CMOVC, CMOVE, CMOVG, CMOVGE, CMOVL, CMOVLE, CMOVNA, CMOVNAE, CMOVNB, CMOVNBE, CMOVNC, CMOVNE, CMOVNG, CMOVNGE, CMOVNL, CMOVNLE, CMOVNO, CMOVNP, CMOVNS, CMOVNZ, CMOVO, CMOVP, CMOVPE, CMOVPO, CMOVS, CMOVZ)''
|-
| SYSENTER || SYStem call ENTER ||
|-
| SYSEXIT || SYStem call EXIT ||
|-
| UD2 || Undefined Instruction || Generates an invalid opcode. This instruction is provided for software testing to explicitly generate an invalid opcode. The opcode for this instruction is reserved for this purpose.
|}
 
====Added with [[Streaming SIMD Extensions|SSE]]====
 
{| class="wikitable"
! Instruction !! Meaning !! Notes
|-
| MASKMOVQ || Masked Move of Quadword || Selectively write bytes from mm1 to memory location using the byte mask in mm2
|-
| MOVNTPS || Move Aligned Four Packed Single-FP Non Temporal || Move packed single-precision floating-point values from xmm to m128, minimizing pollution in the cache hierarchy.
|-
| MOVNTQ || Move Quadword Non-Temporal ||
|-
| PREFETCH0 || Prefetch Data from Address  || Prefetch into all cache levels
|-
| PREFETCH1 || Prefetch Data from Address  || Prefetch into all cache levels EXCEPT L1
|-
| PREFETCH2 || Prefetch Data from Address  || Prefetch into all cache levels EXCEPT L1 and L2
|-
| PREFETCHNTA || Prefetch Data from Address  || Prefetch into all cache levels to non-temporal cache structure
|-
| SFENCE || Store Fence || Processor hint to make sure all store operations that took place prior to the SFENCE call are globally visible
|}
 
====Added with [[SSE2]]====
 
{| class="wikitable"
! Instruction !! Meaning !! Notes
|-
| CLFLUSH || Cache Line Flush  || Invalidates the cache line that contains the linear address specified with the source operand from all levels of the processor cache hierarchy
|-
| LFENCE || Load Fence || Serializes load operations.
|-
| MASKMOVDQU || Masked Move of Double Quadword Unaligned || Stores selected bytes from the source operand (first operand) into a 128-bit memory location
|-
| MFENCE || Memory Fence || Performs a serializing operation on all load and store instructions that were issued prior the MFENCE instruction.
|-
| MOVNTDQ || Move Double Quadword Non-Temporal || Move double quadword from xmm to m128, minimizing pollution in the cache hierarchy.
|-
| MOVNTI || Move Doubleword Non-Temporal || Move doubleword from r32 to m32, minimizing pollution in the cache hierarchy.
|-
| MOVNTPD || Move Packed Double-Precision Floating-Point Values Non-Temporal || Move packed double-precision floating-point values from xmm to m128, minimizing pollution in the cache hierarchy.
 
|-
| PAUSE || Provides a hint to the processor that the following code is a spin loop || for cacheability
|}
 
====Added with [[SSE3]]====
 
{| class="wikitable"
! Instruction !! Meaning !! Notes
|-
| LDDQU || Load Unaligned Integer 128 bits || Instructionally equivalent to MOVDQU. Used for video encoding.
|-
| MONITOR EAX, ECX, EDX || Setup Monitor Address || Sets up a linear address range to be monitored by hardware and activates the monitor.
|-
| MWAIT EAX, ECX || Monitor Wait || Processor hint to stop instruction execution and enter an implementation-dependent optimized state until occurrence of a class of events.
|}
 
====Added with [[x86-64]]====
 
{| class="wikitable"
! Instruction !! Meaning !! Notes
|-
| CDQE || Sign extend EAX into RAX ||
|-
| CQO || Sign extend RAX into RDX:RAX ||
|-
| CMPSQ || CoMPare String Quadword ||
|-
| CMPXCHG16B || CoMPare and eXCHanGe 16 Bytes ||
|-
| IRETQ || 64-bit Return from Interrupt ||
|-
| JRCXZ || Jump if RCX is zero ||
|-
| LODSQ || LOaD String Quadword ||
|-
| MOVSXD || MOV with Sign Extend 32-bit to 64-bit ||
|-
| POPFQ || POP RFLAGS Register ||
|-
| PUSHFQ || PUSH RFLAGS Register ||
|-
| RDTSCP || ReaD Time Stamp Counter and Processor ID ||
|-
| SCASQ || SCAn String Quadword ||
|-
| STOSQ || STOre String Quadword ||
|-
| SWAPGS || Exchange GS base with KernelGSBase MSR ||
|}
 
====Added with [[x86 virtualization#AMD virtualization (AMD-V)|AMD-V]]====
 
{| class="wikitable"
! Instruction !! Meaning !! Notes
|-
| CLGI || Clear Global Interrupt Flag || Clears the GIF
|-
| INVLPGA || Invalidate TLB entry in a specified ASID || Invalidates the TLB mapping for the virtual page specified in RAX and the ASID specified in ECX.
|-
| MOV(CRn) || Move to or from control registers || Moves 32- or 64-bit contents to control register and vice versa.
|-
| SKINIT || Secure Init and Jump with Attestation || Verifiable startup of trusted software based on secure hash comparison
|-
| STGI || Set Global Interrupt Flag || Sets the GIF.
|-
| VMLOAD || Load state From VMCB || Loads a subset of processor state from the VMCB specified by the physical address in the RAX register.
|-
| VMMCALL || Call VMM || Used exclusively to communicate with VMM
|-
| VMRUN || Run virtual machine || Performs a switch to the guest OS.
|-
| VMSAVE || Save state To VMCB || Saves additional guest state to VMCB.
|}
 
====Added with [[Intel VT-x]]====
 
VMPTRLD, VMPTRST, VMCLEAR, VMREAD, VMWRITE, VMCALL, VMLAUNCH, VMRESUME, VMXOFF, VMXON
 
====Added with [[SSE4a]]====
 
LZCNT, POPCNT (POPulation CouNT) - advanced bit manipulation
 
==[[x87]] floating-point instructions==
 
===Original [[8087]] instructions===
{| class="wikitable"
! Instruction !! Meaning !! Notes
|-
| F2XM1 || <math>(2^x) - 1</math> || more precise than <math>2^x</math> for <math>x</math> close to zero
|-
| FABS || Absolute value ||
|-
| FADD || Add ||
|-
| FADDP || Add and pop ||
|-
| FBLD || Load BCD ||
|-
| FBSTP || Store BCD and pop ||
|-
| FCHS || Change sign ||
|-
| FCLEX || Clear exceptions ||
|-
| FCOM || Compare ||
|-
| FCOMP || Compare and pop ||
|-
| FCOMPP || Compare and pop twice ||
|-
| FDECSTP || Decrement floating point stack pointer ||
|-
| FDISI || Disable interrupts || 8087 only, otherwise FNOP
|-
| FDIV || Divide || [[Pentium FDIV bug]]
|-
| FDIVP || Divide and pop ||
|-
| FDIVR || Divide reversed ||
|-
| FDIVRP || Divide reversed and pop ||
|-
| FENI || Enable interrupts || 8087 only, otherwise FNOP
|-
| FFREE || Free register ||
|-
| FIADD || Integer add ||
|-
| FICOM || Integer compare ||
|-
| FICOMP || Integer compare and pop ||
|-
| FIDIV || Integer divide ||
|-
| FIDIVR || Integer divide reversed ||
|-
| FILD || Load integer ||
|-
| FIMUL || Integer multiply ||
|-
| FINCSTP || Increment floating point stack pointer ||
|-
| FINIT || Initialize floating point processor ||
|-
| FIST || Store integer ||
|-
| FISTP || Store integer and pop ||
|-
| FISUB || Integer subtract ||
|-
| FISUBR || Integer subtract reversed ||
|-
| FLD || Floating point load ||
|-
| FLD1 || Load 1.0 onto stack ||
|-
| FLDCW || Load control word ||
|-
| FLDENV || Load environment state ||
|-
| FLDENVW || Load environment state, 16-bit ||
|-
| FLDL2E || Load log<sub>2</sub>(e) onto stack ||
|-
| FLDL2T || Load log<sub>2</sub>(10) onto stack ||
|-
| FLDLG2 || Load log<sub>10</sub>(2) onto stack ||
|-
| FLDLN2 || Load ln(2) onto stack ||
|-
| FLDPI || Load π onto stack ||
|-
| FLDZ || Load 0.0 onto stack ||
|-
| FMUL || Multiply ||
|-
| FMULP || Multiply and pop ||
|-
| FNCLEX || Clear exceptions, no wait ||
|-
| FNDISI || Disable interrupts, no wait || 8087 only, otherwise FNOP
|-
| FNENI || Enable interrupts, no wait || 8087 only, otherwise FNOP
|-
| FNINIT || Initialize floating point processor, no wait ||
|-
| FNOP || No operation ||
|-
| FNSAVE || Save FPU state, no wait, 8-bit ||
|-
| FNSAVEW || Save FPU state, no wait, 16-bit ||
|-
| FNSTCW || Store control word, no wait ||
|-
| FNSTENV || Store FPU environment, no wait ||
|-
| FNSTENVW || Store FPU environment, no wait, 16-bit ||
|-
| FNSTSW || Store status word, no wait ||
|-
| FPATAN || Partial arctangent ||
|-
| FPREM || Partial remainder ||
|-
| FPTAN || Partial tangent ||
|-
| FRNDINT || Round to integer ||
|-
| FRSTOR || Restore saved state ||
|-
| FRSTORW || Restore saved state || Perhaps not actually available in 8087
|-
| FSAVE || Save FPU state ||
|-
| FSAVEW || Save FPU state, 16-bit ||
|-
| FSCALE || Scale by factor of 2 ||
|-
| FSQRT || Square root ||
|-
| FST || Floating point store ||
|-
| FSTCW || Store control word ||
|-
| FSTENV || Store FPU environment ||
|-
| FSTENVW || Store FPU environment, 16-bit ||
|-
| FSTP || Store and pop ||
|-
| FSTSW || Store status word ||
|-
| FSUB || Subtract ||
|-
| FSUBP || Subtract and pop ||
|-
| FSUBR || Reverse subtract ||
|-
| FSUBRP || Reverse subtract and pop ||
|-
| FTST || Test for zero ||
|-
| FWAIT || Wait while FPU is executing ||
|-
| FXAM || Examine condition flags ||
|-
| FXCH || Exchange registers ||
|-
| FXTRACT || Extract exponent and significand ||
|-
| FYL2X || y&nbsp;*&nbsp;log<sub>2</sub>&nbsp;x || if y&nbsp;=&nbsp;log<sub>b</sub>&nbsp;2, then the base-b logarithm is computed
|-
| FYL2XP1 || y&nbsp;*&nbsp;log<sub>2</sub>&nbsp;(x+1) || more precise than log<sub>2</sub>&nbsp;z if x is close to zero
|}
 
===Added in specific processors===
 
====Added with [[x87#80287|80287]]====
{| class="wikitable"
! Instruction !! Meaning !! Notes
|-
| FSETPM || Set protected mode || 80287 only, otherwise FNOP
|}
 
====Added with [[x87#80387|80387]]====
{| class="wikitable"
! Instruction !! Meaning !! Notes
|-
| FCOS || Cosine ||
|-
| FLDENVD || Load environment state, 32-bit ||
|-
| FSAVED || Save FPU state, 32-bit ||
|-
| FSTENVD || Store FPU environment, 32-bit ||
|-
| FPREM1 || Partial remainder || Computes IEEE remainder
|-
| FRSTORD || Restore saved state, 32-bit ||
|-
| FSIN || Sine ||
|-
| FSINCOS || Sine and cosine ||
|-
| FSTENVD || Store FPU environment, 32-bit ||
|-
| FUCOM || Unordered compare ||
|-
| FUCOMP || Unordered compare and pop ||
|-
| FUCOMPP || Unordered compare and pop twice ||
|}
 
====Added with [[Pentium Pro]]====
 
* [[FCMOV]] variants: FCMOVB, FCMOVBE, FCMOVE, FCMOVNB, FCMOVNBE, FCMOVNE, FCMOVNU, FCMOVU
* [[FCOMI]] variants: FCOMI, FCOMIP, FUCOMI, FUCOMIP
 
====Added with [[Streaming SIMD Extensions|SSE]]====
FXRSTOR, FXSAVE
 
These are also supported on later Pentium IIs which do not contain SSE support
 
====Added with SSE3====
 
FISTTP (x87 to integer conversion with truncation regardless of status word)
 
===Undocumented x87 instructions===
FFREEP performs FFREE ST(i) and pop stack
 
==[[SIMD]] instructions==
 
===[[MMX (instruction set)|MMX]] instructions===
 
====Added with [[Pentium MMX]]====
EMMS, MOVD, MOVQ, PACKSSDW, PACKSSWB, PACKUSWB, PADDB, PADDD, PADDSB, PADDSW, PADDUSB, PADDUSW, PADDW, PAND, PANDN, PCMPEQB, PCMPEQD, PCMPEQW, PCMPGTB, PCMPGTD, PCMPGTW, PMADDWD, PMULHW, PMULLW, POR, PSLLD, PSLLQ, PSLLW, PSRAD, PSRAW, PSRLD, PSRLQ, PSRLW, PSUBB, PSUBD, PSUBSB, PSUBSW, PSUBUSB, PSUBUSW, PSUBW, PUNPCKHBW, PUNPCKHDQ, PUNPCKHWD, PUNPCKLBW, PUNPCKLDQ, PUNPCKLWD, PXOR
 
===[[Extended MMX|MMX+]] instructions===
 
====Added with [[Athlon]]====
Same as the SSE [[SIMD]] integer instructions which operated on [[MMX (instruction set)|MMX]] registers.
 
===[[Extended MMX|EMMX]] instructions===
 
====[[Extended MMX|EMMI]] instructions====
: (added with [[6x86MX]] from [[Cyrix]], deprecated now)
PAVEB, PADDSIW, PMAGW, PDISTIB, PSUBSIW, PMVZB, PMULHRW, PMVNZB, PMVLZB, PMVGEZB, PMULHRIW, PMACHRIW
 
===[[3DNow!]] instructions===
 
====Added with [[K6-2]]====
FEMMS, PAVGUSB, PF2ID, PFACC, PFADD, PFCMPEQ, PFCMPGE, PFCMPGT, PFMAX, PFMIN, PFMUL, PFRCP, PFRCPIT1, PFRCPIT2, PFRSQIT1, PFRSQRT, PFSUB, PFSUBR, PI2FD, PMULHRW, PREFETCH, PREFETCHW
 
===3DNow!+ instructions===
 
====Added with [[Athlon]]====
PF2IW, PFNACC, PFPNACC, PI2FW, PSWAPD
 
====Added with [[Geode GX]]====
PFRSQRTV, PFRCPV
 
===[[Streaming SIMD Extensions|SSE]] instructions===
''Added with [[Pentium III]]''
 
====SSE SIMD floating-point instructions====
ADDPS, ADDSS, CMPPS, CMPSS, COMISS, CVTPI2PS, CVTPS2PI, CVTSI2SS, CVTSS2SI, CVTTPS2PI, CVTTSS2SI, DIVPS, DIVSS, LDMXCSR, MAXPS, MAXSS, MINPS, MINSS, MOVAPS, MOVHLPS, MOVHPS, MOVLHPS, MOVLPS, MOVMSKPS, MOVNTPS, MOVSS, MOVUPS, MULPS, MULSS, RCPPS, RCPSS, RSQRTPS, RSQRTSS, SHUFPS, SQRTPS, SQRTSS, STMXCSR, SUBPS, SUBSS, UCOMISS, UNPCKHPS, UNPCKLPS
 
====SSE SIMD integer instructions====
ANDNPS, ANDPS, ORPS, PAVGB, PAVGW, PEXTRW, PINSRW, PMAXSW, PMAXUB, PMINSW, PMINUB, PMOVMSKB, PMULHUW, PSADBW, PSHUFW, XORPS
 
{| class="wikitable"
! Instruction !! Opcode !! Meaning
|-
|MOVUPS xmm1, xmm2/m128||0F 10 /r||Move Unaligned Packed Single-Precision Floating-Point Values
|-
|MOVSS xmm1, xmm2/m32||F3 0F 10 /r||Move Scalar Single-Precision Floating-Point Values
|-
|MOVUPS xmm2/m128, xmm1||0F 11 /r||Move Unaligned Packed Single-Precision Floating-Point Values
|-
|MOVSS xmm2/m32, xmm1||F3 0F 11 /r||Move Scalar Single-Precision Floating-Point Values
|-
|MOVLPS xmm, m64||0F 12 /r||Move Low Packed Single-Precision Floating-Point Values
|-
|MOVHLPS xmm1, xmm2||0F 12 /r||Move Packed Single-Precision Floating-Point Values High to Low
|-
|MOVLPS m64, xmm||0F 13 /r||Move Low Packed Single-Precision Floating-Point Values
|-
|UNPCKLPS xmm1, xmm2/m128||0F 14 /r||Unpack and Interleave Low Packed Single-Precision Floating-Point Values
|-
|UNPCKHPS xmm1, xmm2/m128||0F 15 /r||Unpack and Interleave High Packed Single-Precision Floating-Point Values
|-
|MOVHPS xmm, m64||0F 16 /r||Move High Packed Single-Precision Floating-Point Values
|-
|MOVLHPS xmm1, xmm2||0F 16 /r||Move Packed Single-Precision Floating-Point Values Low to High
|-
|MOVHPS m64, xmm||0F 17 /r||Move High Packed Single-Precision Floating-Point Values
|-
|PREFETCHNTA||0F 18 /0||Prefetch Data Into Caches (non-temporal data with respect to all cache levels)
|-
|PREFETCH0||0F 18 /1||Prefetch Data Into Caches (temporal data)
|-
|PREFETCH1||0F 18 /2||Prefetch Data Into Caches (temporal data with respect to first level cache)
|-
|PREFETCH2||0F 18 /3||Prefetch Data Into Caches (temporal data with respect to second level cache)
|-
|NOP||0F 1F /0||No Operation
|-
|MOVAPS xmm1, xmm2/m128||0F 28 /r||Move Aligned Packed Single-Precision Floating-Point Values
|-
|MOVAPS xmm2/m128, xmm1||0F 29 /r||Move Aligned Packed Single-Precision Floating-Point Values
|-
|CVTPI2PS xmm, mm/m64||0F 2A /r||Convert Packed Dword Integers to Packed Single-Precision FP Values
|-
|CVTSI2SS xmm, r/m32||F3 0F 2A /r||Convert Dword Integer to Scalar Single-Precision FP Value
|-
|MOVNTPS m128, xmm||0F 2B /r||Store Packed Single-Precision Floating-Point Values Using Non-Temporal Hint
|-
|CVTTPS2PI mm, xmm/m64||0F 2C /r||Convert with Truncation Packed Single-Precision FP Values to Packed Dword Integers
|-
|CVTTSS2SI r32, xmm/m32||F3 0F 2C /r||Convert with Truncation Scalar Single-Precision FP Value to Dword Integer
|-
|CVTPS2PI mm, xmm/m64||0F 2D /r||Convert Packed Single-Precision FP Values to Packed Dword Integers
|-
|CVTSS2SI r32, xmm/m32||F3 0F 2D /r||Convert Scalar Single-Precision FP Value to Dword Integer
|-
|UCOMISS xmm1, xmm2/m32||0F 2E /r||Unordered Compare Scalar Single-Precision Floating-Point Values and Set EFLAGS
|-
|COMISS xmm1, xmm2/m32||0F 2F /r||Compare Scalar Ordered Single-Precision Floating-Point Values and Set EFLAGS
|-
|SQRTPS xmm1, xmm2/m128||0F 51 /r||Compute Square Roots of Packed Single-Precision Floating-Point Values
|-
|SQRTSS xmm1, xmm2/m32||F3 0F 51 /r||Compute Square Root of Scalar Single-Precision Floating-Point Value
|-
|RSQRTPS xmm1, xmm2/m128||0F 52 /r||Compute Reciprocal of Square Root of Packed Single-Precision Floating-Point Value
|-
|RSQRTSS xmm1, xmm2/m32||F3 0F 52 /r||Compute Reciprocal of Square Root of Scalar Single-Precision Floating-Point Value
|-
|RCPPS xmm1, xmm2/m128||0F 53 /r||Compute Reciprocal of Packed Single-Precision Floating-Point Values
|-   
|RCPSS xmm1, xmm2/m32||F3 0F 53 /r||Compute Reciprocal of Scalar Single-Precision Floating-Point Values
|-
|ANDPS xmm1, xmm2/m128||0F 54 /r||Bitwise Logical AND of Packed Single-Precision Floating-Point Values
|-
|ANDNPS xmm1, xmm2/m128||0F 55 /r||Bitwise Logical AND NOT of Packed Single-Precision Floating-Point Values
|-
|ORPS xmm1, xmm2/m128||0F 56 /r||Bitwise Logical OR of Single-Precision Floating-Point Values
|-
|XORPS xmm1, xmm2/m128||0F 57 /r||Bitwise Logical XOR for Single-Precision Floating-Point Values
|-
|ADDPS xmm1, xmm2/m128||0F 58 /r||Add Packed Single-Precision Floating-Point Values
|-
|ADDSS xmm1, xmm2/m32||F3 0F 58 /r||Add Scalar Single-Precision Floating-Point Values
|-
|MULPS xmm1, xmm2/m128||0F 59 /r||Multiply Packed Single-Precision Floating-Point Values
|-
|MULSS xmm1, xmm2/m32||F3 0F 59 /r||Multiply Scalar Single-Precision Floating-Point Values
|-
|SUBPS xmm1, xmm2/m128||0F 5C /r||Subtract Packed Single-Precision Floating-Point Values
|-
|SUBSS xmm1, xmm2/m32||F3 0F 5C /r||Subtract Scalar Single-Precision Floating-Point Values
|-
|MINPS xmm1, xmm2/m128||0F 5D /r||Return Minimum Packed Single-Precision Floating-Point Values
|-
|MINSS xmm1, xmm2/m32||F3 0F 5D /r||Return Minimum Scalar Single-Precision Floating-Point Values
|-
|DIVPS xmm1, xmm2/m128||0F 5E /r||Divide Packed Single-Precision Floating-Point Values
|-
|DIVSS xmm1, xmm2/m32||F3 0F 5E /r||Divide Scalar Single-Precision Floating-Point Values
|-
|MAXPS xmm1, xmm2/m128||0F 5F /r||Return Maximum Packed Single-Precision Floating-Point Values
|-
|MAXSS xmm1, xmm2/m32||F3 0F 5F /r||Return Maximum Scalar Single-Precision Floating-Point Values
|-
|PSHUFW mm1, mm2/m64, imm8||0F 70 /r ib||Shuffle Packed Words
|-
|LDMXCSR m32||0F AE /2||Load MXCSR Register State
|-
|STMXCSR m32||0F AE /3||Store MXCSR Register State
|-
|SFENCE||0F AE /7||Store Fence
|-
|CMPPS xmm1, xmm2/m128, imm8||0F C2 /r ib||Compare Packed Single-Precision Floating-Point Values
|-
|CMPSS xmm1, xmm2/m32, imm8||F3 0F C2 /r ib||Compare Scalar Single-Precision Floating-Point Values
|-
|PINSRW mm, r32/m16, imm8||0F C4 /r||Insert Word
|-
|PEXTRW r32, mm, imm8||0F C5 /r||Extract Word
|-
|SHUFPS xmm1, xmm2/m128, imm8||0F C6 /r ib||Shuffle Packed Single-Precision Floating-Point Values
|-
|PMOVMSKB r32, mm||0F D7 /r||Move Byte Mask
|-
|PMINUB mm1, mm2/m64||0F DA /r||Minimum of Packed Unsigned Byte Integers
|-
|PMAXUB mm1, mm2/m64||0F DE /r||Maximum of Packed Unsigned Byte Integers
|-
|PAVGB mm1, mm2/m64||0F E0 /r||Average Packed Integers
|-
|PAVGW mm1, mm2/m64||0F E3 /r||Average Packed Integers
|-
|PMULHUW mm1, mm2/m64||0F E4 /r||Multiply Packed Unsigned Integers and Store High Result
|-
|MOVNTQ m64, mm||0F E7 /r||Store of Quadword Using Non-Temporal Hint
|-
|PMINSW mm1, mm2/m64||0F EA /r||Minimum of Packed Signed Word Integers
|-
|PMAXSW mm1, mm2/m64||0F EE /r||Maximum of Packed Signed Word Integers
|-
|PSADBW mm1, mm2/m64||0F F6 /r||Compute Sum of Absolute Differences
|-
|MASKMOVQ mm1, mm2||0F F7 /r||Store Selected Bytes of Quadword
|}
 
===[[SSE2]] instructions===
''Added with [[Pentium 4]]''
''Also see integer instructions added with Pentium 4''
 
====SSE2 SIMD floating-point instructions====
{| class="wikitable"
! Instruction !! Opcode !! Meaning
|-
|ADDPD xmm1, xmm2/m128||66 0F 58 /r||Add Packed Double-Precision Floating-Point Values
|-
|ADDSD xmm1, xmm2/m64||F2 0F 58 /r||Add Low Double-Precision Floating-Point Value
|-
|ANDNPD xmm1, xmm2/m128||66 0F 55 /r||Bitwise Logical AND NOT
|-
|CMPPD xmm1, xmm2/m128, imm8||66 0F C2 /r ib||Compare Packed Double-Precision Floating-Point Values
|-
|CMPSD xmm1, xmm2/m64, imm8||F2 0F C2 /r ib||Compare Low Double-Precision Floating-Point Values
|}
ADDPD, ADDSD, ANDNPD, ANDPD, CMPPD, CMPSD*, COMISD, CVTDQ2PD, CVTDQ2PS, CVTPD2DQ, CVTPD2PI, CVTPD2PS, CVTPI2PD, CVTPS2DQ, CVTPS2PD, CVTSD2SI, CVTSD2SS, CVTSI2SD, CVTSS2SD, CVTTPD2DQ, CVTTPD2PI, CVTTPS2DQ, CVTTSD2SI, DIVPD, DIVSD, MAXPD, MAXSD, MINPD, MINSD, [[MOVAPD]], [[MOVHPD]], MOVLPD, MOVMSKPD, MOVSD*, MOVUPD, MULPD, MULSD, ORPD, SHUFPD, SQRTPD, SQRTSD, SUBPD, SUBSD, UCOMISD, UNPCKHPD, UNPCKLPD, XORPD
 
* CMPSD ''and'' MOVSD ''have the same name as the [[string (computer science)|string]] instruction mnemonics'' CMPSD (CMPS) ''and'' MOVSD (MOVS)''; however, the former refer to scalar [[Double precision|double-precision]] [[Floating point|floating-points]] whereas the latters refer to [[Integer (computer science)|doubleword]] strings.''
 
====SSE2 SIMD integer instructions====
MOVDQ2Q, MOVDQA, MOVDQU, MOVQ2DQ, PADDQ, PSUBQ, PMULUDQ, PSHUFHW, PSHUFLW, PSHUFD, PSLLDQ, PSRLDQ, PUNPCKHQDQ, PUNPCKLQDQ
 
===[[SSE3]] instructions===
''Added with Pentium 4 supporting SSE3''
''Also see integer and floating-point instructions added with Pentium 4 SSE3''
 
====SSE3 SIMD floating-point instructions====
*ADDSUBPD, ADDSUBPS (for Complex Arithmetic)
*HADDPD, HADDPS, HSUBPD, HSUBPS (for Graphics)
*[[MOVDDUP]], MOVSHDUP, MOVSLDUP (for Complex Arithmetic)
 
===[[SSSE3]] instructions===
''Added with [[Xeon]] 5100 series and initial [[Core 2]]''
*PSIGNW, PSIGND, PSIGNB
*PSHUFB
*PMULHRSW, PMADDUBSW
*PHSUBW, PHSUBSW, PHSUBD
*PHADDW, PHADDSW, PHADDD
*PALIGNR
*PABSW, PABSD, PABSB
 
===[[SSE4]] instructions===
 
====[[SSE4.1]]====
''Added with [[Core 2]] manufactured in [[45nm]]''
*MPSADBW
*PHMINPOSUW
*PMULLD, PMULDQ
*DPPS, DPPD
*BLENDPS, BLENDPD, BLENDVPS, BLENDVPD, PBLENDVB, PBLENDW
*PMINSB, PMAXSB, PMINUW, PMAXUW, PMINUD, PMAXUD, PMINSD, PMAXSD
*ROUNDPS, ROUNDSS, ROUNDPD, ROUNDSD
*INSERTPS, PINSRB, PINSRD/PINSRQ, EXTRACTPS, PEXTRB, PEXTRW, PEXTRD/PEXTRQ
*PMOVSXBW, PMOVZXBW, PMOVSXBD, PMOVZXBD, PMOVSXBQ, PMOVZXBQ, PMOVSXWD, PMOVZXWD, PMOVSXWQ, PMOVZXWQ, PMOVSXDQ, PMOVZXDQ
*PTEST
*PCMPEQQ
*PACKUSDW
*MOVNTDQA
 
====[[SSE4a]]====
''Added with [[Phenom (processor)|Phenom]] processors''
 
*LZCNT, POPCNT (POPulation CouNT) - advanced bit manipulation
*EXTRQ/INSERTQ
*MOVNTSD/MOVNTSS
 
====[[SSE4.2]]====
''Added with [[Nehalem (microarchitecture)|Nehalem]] processors''
*CRC32
*PCMPESTRI
*PCMPESTRM
*PCMPISTRI
*PCMPISTRM
*PCMPGTQ
 
===[[Fused multiply–add|FMA]] instructions===
{| class="wikitable"
! Instruction !! Opcode !! Meaning !! Notes
|-
|VFMADDPD xmm0, xmm1, xmm2, xmm3||C4E3 WvvvvL01 69 /r /is4||Fused Multiply-Add of Packed Double-Precision Floating-Point Values||
|-
|VFMADDPS xmm0, xmm1, xmm2, xmm3||C4E3 WvvvvL01 68 /r /is4||Fused Multiply-Add of Packed Single-Precision Floating-Point Values||
|-
|VFMADDSD xmm0, xmm1, xmm2, xmm3||C4E3 WvvvvL01 6B /r /is4||Fused Multiply-Add of Scalar Double-Precision Floating-Point Values||
|-
|VFMADDSS xmm0, xmm1, xmm2, xmm3||C4E3 WvvvvL01 6A /r /is4||Fused Multiply-Add of Scalar Single-Precision Floating-Point Values||
|-
|VFMADDSUBPD xmm0, xmm1, xmm2, xmm3||C4E3 WvvvvL01 5D /r /is4||Fused Multiply-Alternating Add/Subtract of Packed Double-Precision Floating-Point Values||
|-
|VFMADDSUBPS xmm0, xmm1, xmm2, xmm3||C4E3 WvvvvL01 5C /r /is4||Fused Multiply-Alternating Add/Subtract of Packed Single-Precision Floating-Point Values||
|-
|VFMSUBADDPD xmm0, xmm1, xmm2, xmm3||C4E3 WvvvvL01 5F /r /is4||Fused Multiply-Alternating Subtract/Add of Packed Double-Precision Floating-Point Values||
|-
|VFMSUBADDPS xmm0, xmm1, xmm2, xmm3||C4E3 WvvvvL01 5E /r /is4||Fused Multiply-Alternating Subtract/Add of Packed Single-Precision Floating-Point Values||
|-
|VFMSUBPD xmm0, xmm1, xmm2, xmm3||C4E3 WvvvvL01 6D /r /is4||Fused Multiply-Subtract of Packed Double-Precision Floating-Point Values||
|-
|VFMSUBPS xmm0, xmm1, xmm2, xmm3||C4E3 WvvvvL01 6C /r /is4||Fused Multiply-Subtract of Packed Single-Precision Floating-Point Values||
|-
|VFMSUBSD xmm0, xmm1, xmm2, xmm3||C4E3 WvvvvL01 6F /r /is4||Fused Multiply-Subtract of Scalar Double-Precision Floating-Point Values||
|-
|VFMSUBSS xmm0, xmm1, xmm2, xmm3||C4E3 WvvvvL01 6E /r /is4||Fused Multiply-Subtract of Scalar Single-Precision Floating-Point Values||
|-
|VFNMADDPD xmm0, xmm1, xmm2, xmm3||C4E3 WvvvvL01 79 /r /is4||Fused Negative Multiply-Add of Packed Double-Precision Floating-Point Values||
|-
|VFNMADDPS xmm0, xmm1, xmm2, xmm3||C4E3 WvvvvL01 78 /r /is4||Fused Negative Multiply-Add of Packed Single-Precision Floating-Point Values||
|-
|VFNMADDSD xmm0, xmm1, xmm2, xmm3||C4E3 WvvvvL01 7B /r /is4||Fused Negative Multiply-Add of Scalar Double-Precision Floating-Point Values||
|-
|VFNMADDSS xmm0, xmm1, xmm2, xmm3||C4E3 WvvvvL01 7A /r /is4||Fused Negative Multiply-Add of Scalar Single-Precision Floating-Point Values||
|-
|VFNMSUBPD xmm0, xmm1, xmm2, xmm3||C4E3 WvvvvL01 7D /r /is4||Fused Negative Multiply-Subtract of Packed Double-Precision Floating-Point Values||
|-
|VFNMSUBPS xmm0, xmm1, xmm2, xmm3||C4E3 WvvvvL01 7C /r /is4||Fused Negative Multiply-Subtract of Packed Single-Precision Floating-Point Values||
|-
|VFNMSUBSD xmm0, xmm1, xmm2, xmm3||C4E3 WvvvvL01 7F /r /is4||Fused Negative Multiply-Subtract of Scalar Double-Precision Floating-Point Values||
|-
|VFNMSUBSS xmm0, xmm1, xmm2, xmm3||C4E3 WvvvvL01 7E /r /is4||Fused Negative Multiply-Subtract of Scalar Single-Precision Floating-Point Values||
|}
 
== Intel AES instructions ==
{{Main|AES instruction set}}
6 new instructions.
{| class="wikitable"
|-
! Instruction
! Description
|-
| AESENC
| Perform one round of an [[Advanced Encryption Standard|AES]] encryption flow
|-
| AESENCLAST
| Perform the last round of an AES encryption flow
|-
| AESDEC
| Perform one round of an AES decryption flow
|-
| AESDECLAST
| Perform the last round of an AES decryption flow
|-
| AESKEYGENASSIST
| Assist in AES round key generation
|-
| AESIMC
| Assist in AES Inverse Mix Columns
|}
 
== Undocumented instructions ==
 
The x86 CPUs contain [[Illegal opcode|undocumented instructions]] which are implemented on the chips but not listed in some official documents. They can be found in various sources across the Internet, such as [[Ralf Brown's Interrupt List]] and at http://sandpile.org.
 
{| class="wikitable"
! Mnemonic
! Opcode
! Description
! Status
|-
|AAM imm8
|D4 imm8
|Divide AL by imm8, put the quotient in AH, and the remainder in AL
|Available beginning with 8086, documented since Pentium (earlier documentation lists no arguments)
|-
|AAD imm8
|D5 imm8
|Multiplication counterpart of AAM
|Available beginning with 8086, documented since Pentium (earlier documentation lists no arguments)
|-
|SALC
|D6
|Set AL depending on the value of the Carry Flag (a&nbsp;1-byte alternative of SBB&nbsp;AL,&nbsp;AL)
|Available beginning with 8086, but only documented since Pentium Pro.
|-
|UD1
|0F B9
|Intentionally undefined instruction, but unlike UD2 this was not published
|-
|ICEBP
|F1
|Single byte single-step exception / Invoke [[In-circuit emulator|ICE]]
|Available beginning with 80386, documented (as INT1) since Pentium Pro
|-
|[[LOADALL]]
|0F 05
|Loads All Registers from Memory Address 0x000800H
|Only available on 80286
|-
|Unknown mnemonic
|0F 04
|Exact purpose unknown, causes CPU hang. (the only way out is CPU reset)<ref>{{cite web
| url = http://www.sandpile.org/post/msgs/20004129.htm
| title = Re: Undocumented opcodes (HINT_NOP)
| accessdate = 2010-11-07
}}</ref>
In some implementations, emulated through [[BIOS]] as a [[halt]]ing sequence.<ref>{{cite web
| url = http://www.sandpile.org/post/msgs/20003986.htm
| title = Re: Also some undocumented 0Fh opcodes
| accessdate = 2010-11-07
}}</ref>
|Only available on 80286
|-
|LOADALLD
|0F 07
|Loads All Registers from Memory Address ES:EDI
|Only available on 80386
|-
|POP CS
|0F
|Pop top of the stack into CS Segment register (causing a far jump)
|Only available on earliest models of 8086. Beginning with 80286 this opcode is used as a prefix for 2-Byte-Instructions
|-
|MOV CS,r/m
|8E/1
|Moves a value from register/memory into CS Segment register (causing a far jump)
|Only available on earliest models of 8086. Beginning with 80286 this opcode causes an invalid opcode exception
|-
|MOV ES,r/m
|8E/4
|Moves a value from register/memory into ES segment register
|Only available on earliest models of 8086. On 80286 this opcode causes an invalid opcode exception. Beginning with 80386 the value is moved into the FS segment register.
|-
|MOV CS,r/m
|8E/5
|Pop top of the stack into CS Segment register (?)
|Only available on earliest models of 8086. On 80286 this opcode causes an invalid opcode exception. Beginning with 80386 the value is moved into the GS segment register.
|-
|MOV SS,r/m
|8E/6
|Moves a value from register/memory into SS Segment register
|Only available on earliest models of 8086. Beginning with 80286 this opcode causes an invalid opcode exception
|-
|MOV DS,r/m
|8E/7
|Moves a value from register/memory into DS Segment register
|Only available on earliest models of 8086. Beginning with 80286 this opcode causes an invalid opcode exception
|}
 
==See also==
* [[CLMUL instruction set|CLMUL]]
* [[XOP instruction set|XOP]]
* [[F16C]]
* [[FMA instruction set|FMA]]
* [[RdRand]]
* [[Larrabee (microarchitecture)|Larrabee extensions]]
* [[Advanced Vector Extensions 2]]
* [[Bit Manipulation Instruction Sets]]
* [[CPUID]]
 
==References==
{{reflist}}
{{refbegin}}
* [http://www.intel.com/products/processor/manuals/ Intel Software Developer's Manuals]
{{refend}}
 
==External links==
{{wikibooks|x86 Assembly|X86 Instructions|X86 Instructions}}
* [http://web.archive.org/web/20100407092131/http://home.comcast.net/~fbui/intel.html The 8086 / 80286 / 80386 / 80486 Instruction Set]
* [http://www.intel.com/products/processor/manuals/index.htm Free IA-32 and x86-64 documentation], provided by Intel
* [http://www.nasm.us/doc/nasmdocb.html Netwide Assembler Instruction List] (from [[Netwide Assembler]])
* [http://ref.x86asm.net x86 Opcode and Instruction Reference]
 
{{x86 assembly topics}}
 
{{DEFAULTSORT:x86 Instruction Listings}}
[[Category:X86 instructions| ]]
[[Category:Instruction set listings]]

Latest revision as of 05:26, 15 December 2014

I'm a 34 years old, married and work at the college (Film Studies).
In my free time I learn Dutch. I've been there and look forward to returning sometime in the future. I love to read, preferably on my ipad. I like to watch Doctor Who Bookbyte Coupon 20% Off and Doctor Who as well as documentaries about anything technological. I like Games Club - Dungeons and Bookbyte Coupon Codes Dragons, Monopoly, Etc..

my webpage - Bookbyte Bonus Code