Lecture(Ⅲ):Floating Point

博客围绕IEEE浮点标准展开,介绍了分数二进制数背景、IEEE浮点标准定义、示例与特性,还提及了舍入、加法、乘法运算,以及C语言中的浮点运算,最后进行了总结并给出额外幻灯片内容。

摘要生成于 C知道 ,由 DeepSeek-R1 满血版支持, 前往体验 >

目录

Background: Fractional binary numbers

IEEE floating point standard: Definition

Example and properties

Rounding, addition, multiplication

Floating point in C

Summary

Additional Slides


Background: Fractional binary numbers

p3---What is 1011.101(2)?
p4---Representation
p5

---Value Representation

---Observations

p6

---Limitation #1

---Limitation #2

IEEE floating point standard: Definition

p8

IEEE Floating Point
---IEEE Standard 754

---Driven by numerical concerns

p9

---Numerical Form

---Encoding

p10

Precision options

---Single precision: 32 bits

---Double precision: 64 bits

---Extended precision: 80 bits (Intel only)

p11“Normalized” Values
p12

Normalized Encoding Example

---Value: float F = 15213.0;

---Significand

---Exponent

---Result:

p13Denormalized Values
p14Special Values
p15Visualization: Floating Point Encodings

Example and properties

p17

Tiny Floating Point Example
---8-bit Floating Point Representation

---Same general form as IEEE Format

p18Dynamic Range (Positive Only)
p19

Distribution of Values
---6-bit IEEE-like format

---Notice how the distribution gets denser toward zero.

p20Distribution of Values (close-up view)
--6-bit IEEE-like format
p21

Special Properties of the IEEE Encoding
---FP Zero Same as Integer Zero

---Can (Almost) Use Unsigned Integer Comparison

Rounding, addition, multiplication

p23Floating Point Opera.ons: Basic Idea
p24Rounding Modes (illustrate with $ rounding)
p25

Closer Look at Round-To-Even
---Default Rounding Mode

---Applying to Other Decimal Places / Bit Positions

p26

Rounding Binary Numbers
---Binary Fractional Numbers

---Examples

p27FP Multiplication
p28Floating Point Addition
p29

Mathematical Properties of FP Add

---Compare to those of Abelian Group

---Monotonicity

p30

Mathematical Properties of FP Mult
---Compare to Commutative Ring

---Monotonicity

Floating point in C

p32

Floating Point in C
---C Guarantees Two Levels

---Conversions/Casting

p33Floating Point Puzzles
---For each of the following C expressions, either:

Summary

p34

---IEEE Floating Point has clear mathematical properties

---Represents numbers of form M x 2E

---One can reason about operations independent of

---Not the same as real arithmetic

Additional Slides

p36

Creating Floating Point Number
---Steps

---Case Study

p37Normalize
---Requirement
p38Rounding
---Round up conditions
p39Postnormalize
---Issue
p40Interesting Numbers
 

 

评论 1
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值