C语言中的位域、字节序、比特序、大小端

admin•2025-09-17 03:17:05•网站建设•阅读24

C语言中的位域、字节序、比特序、大小端分类： CC 1.比特序位序 bit numbering bit endianness我们知道一个字节有8位，也就是8个比特位。从第0位到

C语言中的位域、字节序、比特序、大小端

分类： C/C++

1.比特序 / 位序 / bit numbering / bit endianness
我们知道一个字节有8位，也就是8个比特位。从第0位到第7位共8位。比特序就是用来描述比特位在字节中的存放顺序的。通过阅读网页 http://en.wikipedia/wiki/Bit_numbering的内容，关于比特序我们得到下面的结论：（1）比特序分为两种： LSB 0 位序和 MSB 0 位序。 LSB是指 least significant bit，MSB是指 most significant bit。 LSB 0 位序是指：字节的第0位存放数据的 least significant bit，即我们的数据的最低位存放在字节的第0位。 MSB 0 位序是指：字节的第0位存放数据的most significant bit，即我们的数据的最高位存放在字节的第0位。
所以说对于代码：char *ch = 0x96; // 0x96 = 1001 0110
指针ch到底指向哪里呢？不难知道，如果是 LSB 0 位序则显然指针ch指向最右边的也是最低位的0. 而如果是 MSB 0 位序则显然指针ch指向最左边的也是最高位的1. LSB 0: A container for 8-bit binary number with the highlighted least significant bit assigned the bit number 0
MSB 0:A container for 8-bit binary number with the highlighted most significant bit assigned the bit number 0
（2）小端CPU通常采用的是LSB 0 位序，但是大端CPU却有可能采用 LSB 0 位序也有可能采用的是 MSB 0 位序 (Little-endian CPUs usually employ "LSB 0" bit numbering, however both bit numbering conventions can be seen in big-endianmachines. ) （3）推荐的标准是 MSB 0 位序。 (The recommended style for Request for Comments documents is "MSB 0" bit numbering.) （4） Bit numbering is usually transparent to the software.
2.大小端和字节序 http://en.wikipedia/wiki/Endianess In computing, the term endian or endianness refers to the ordering of individually addressable sub-components within the representation of a larger data item as stored in external memory (or, sometimes, as sent on a serial connection). Each sub-component in the representation has a unique degree of significance, like the place value of digits in a decimal number. These sub-components are typically 16- or 32-bit words, 8-bit bytes, or even bits. Endianness is a difference in data representation at the hardware level and may or may not be transparent at higher levels, depending on factors such as the type of high level language used. 计算机中，术语“端”是指：在内存中的一个较大的数据，它是由各个可以被单独寻址的部分组成，这些组成部分在该数据中是以怎样的顺序存放的呢？而这个问题涉及到“端”的概念，CPU是大端还是小端决定了这些组成部分的存放顺序。这些组成部分可能是 16或32位的字、8位的字节、甚至是比特位。 The most common cases refer to how bytes are ordered within a single 16-, 32-, or 64-bit word。我们通常碰到的情况是：字节是以怎样的顺序存放在一个16、32、64位的数据中。（当我们要存取一个16、32、64位数据的某一组成部分，也就是某一个或几个字节时，就要特别注意机器的“大小端”） A big-endian machine stores the most significant byte first, and a little-endian machine stores the least significant byte first.

Quick Reference - Byte Machine Example
Endian	First Byte (lowest address)	Middle Bytes	Last Byte (highest address)	Summary
big	most significant	...	least significant	Similar to a number written on paper (in Arabic numerals)
little	least significant	...	most significant	Arithmetic calculation order (see carry propagation)

Examples of storing the value 0A0B0C0Dh in memory Big-endian Atomic element size 8-bit, address increment 1-byte (octet)

increasing addresses →
...	0Ah	0Bh	0Ch	0Dh	...

The most significant byte (MSB) value, which is 0Ah in our example, is stored at the memory location with the lowest address, the next byte value in significance, 0Bh, is stored at the following memory location and so on. This is akin to Left-to-Right reading in hexadecimal order.

Atomic element size 16-bit

increasing addresses →
...	0A0Bh	0C0Dh	...

The most significant atomic element stores now the value 0A0Bh, followed by 0C0Dh.

Little-endian Atomic element size 8-bit, address increment 1-byte (octet)

increasing addresses →
...	0Dh	0Ch	0Bh	0Ah	...

The least significant byte (LSB) value, 0Dh, is at the lowest address. The other bytes follow in increasing order of significance.

Atomic element size 16-bit

increasing addresses →
...	0C0Dh	0A0Bh	...

The least significant 16-bit unit stores the value 0C0Dh, immediately followed by 0A0Bh. Note that 0C0Dh and 0A0Bh represent integers, not bit layouts (see bit numbering).

很显然“小端”机器符合“高高低低”的原则。及高位字节或字存放在高地址，低位字节或字存放在低地址。另外“小端”机器中，数据在CPU的寄存器和内存中的存放顺序是一致的。
Byte addresses increasing from right to left 在我们写: 0xFF86 时，很明显地址是从右向左递增的。也就是低位写在右边，高位写在左边。但是当我们写字符串时：char *str = "Hello world!"，却是低位的字符写在左边，高位的字符写在了右边。 With 8-bit atomic elements:

← increasing addresses
...	0Ah	0Bh	0Ch	0Dh	...

The least significant byte (LSB) value, 0Dh, is at the lowest address. The other bytes follow in increasing order of significance.（这个明显符合我们的习惯）

With 16-bit atomic elements:

← increasing addresses
...	0A0Bh	0C0Dh	...

The least significant 16-bit unit stores the value 0C0Dh, immediately followed by 0A0Bh.

The display of text is reversed from the normal display of languages such as English that read from left to right. For example, the word "XRAY" displayed in this manner, with each character stored in an 8-bit atomic element:

← increasing addresses
...	"Y"	"A"	"R"	"X"	...

（可以看到和我们手写的顺序是相反的，这一点特别要注意！）

If pairs of characters are stored in 16-bit atomic elements (using 8 bits per character), it could look even stranger:

← increasing addresses
...	"AY"	"XR"	...

相关的一个C例子：

#include <stdio.h>
#include <stdlib.h>
#include <string.h>
int main()
{
char a[] = {'a', 'b', 'c'};
char b[] = {'d', 'e', 'f'};
a[3] = 0;
printf("strlen(a)=%d, strlen(b)=%d\n", strlen(a), strlen(b));
printf("a=%s, b=%s\n", a, b);
printf("sizeof(a)=%d, sizeof(b)=%d\n", sizeof(a), sizeof(b));
return 0;
}

运行结果： strlen(a)=3, strlen(b)=6
a=abc, b=defabc
sizeof(a)=3, sizeof(b)=3 分析：字符数组a和b都分配在栈上，先分配a, 而a中的字符是如何分配的呢？显然因为“写字符串时，低位的字符写在左边，高位的字符写在了右边”。'a'是最低位，'b'在中间，而'c'在最高位。而栈是从高地址从低地址扩展的。假如是小端CPU的话，按照“高高低低”的原则，高位的'c'应该最先分配，接着是'b'，最后是'a'。分配玩字符数组a之后，在分配字符数组b，同样的道理，高位的'f'应该最先分配，接着是'e'，最后是'd'。再执行a[3] = 0;显然a[3]的地址应该比'c'字符的地址要高。所以该语句执行玩之后的栈的情况如下：高地址 <<---- 低地址 \0 c b a f e d 所以：a字符串打印的结果是：abc，而b字符串打印的结果是:defabc. strlen函数是计算字符串的长度，当然要找到最后的结束字符'\0'，才停止计算。所以字符串a的长度是3,而字符串b的长度是6. sizeof并不根据末尾的结束字符来计算大小。例子2：

#include <stdio.h>
int main()
{
unsigned long array[] = {0x12345678, 0xabcdef01, 0x456789ab};
unsigned short ret;
ret = *((unsigned short *)((unsigned long)array+7));
printf("0x%x\n", ret);
return 0;
}

在“小端”CPU上结果为：0xabab。在“大端”CPU上应该为：0x0112. 例子3：

＃include <stdio.h>
#include <stdlib.h>
int main(void){
int a[5]={1,2,3,4,5};
int *ptr =(int *)(&a+1);
printf("%d,%d\n",*(a+1),*(ptr-1))
return 0;
}

结果为：2，5 （此题与“大小端”无关。）
判断CPU是大端还是小端的方法有有多种：

#include <stdio.h>
#include <assert.h>
int main()
{
unsigned short x = 0xff01;
assert(sizeof(x) >= 2);
if(*(char*)&x == 1) //if(char(x) == 1)
printf("little-endian\n");
else if((char)x > 1)
printf("big-endian\n");
else
printf("unknown\n");
return 0;
}

方法2：

#include <stdio.h>
int main()
{
union{
char c;
int i;
}u;
u.i = 0x0201;
if(u.c == 1)
printf("little-endian\n");
else if(u.c == 2)
printf("big-endian\n");
else
printf("unknown\n");
return 0;
}

3.C语言中的位域先看几个例子：

#include <stdio.h>
union u{
struct {
char i:1;
char j:2;
char m:3;
} s;
char c;
}r;
int main()
{
r.s.i = 1; // 1
r.s.j = 2; // 10
r.s.m = 3; // 011
printf("0x%x\n", r.c);
return 0;
}

gcc -o union union.c ./union 结果：0x1d （== 0001 1101 == 011 10 1）

#include <stdio.h>
union {
struct
{
unsigned char a1:2;
unsigned char a2:3;
unsigned char a3:3;
}x;
unsigned char b;
}d;
int main(int argc, char* argv[])
{
d.b = 100; //100 == 0110 0100
printf("0x%x\n0x%x\n0x%x\n", d.x.a1, d.x.a2, d.x.a3);
return 0;
}

gcc -o union2 union2.c 结果： 0x0 （== 00） 0x1 （== 001） 0x3 （== 011）上面两个例子的运行结果，似乎都说明：小端机器中，位域的低位组成数据的低位，位域的高位组成了数据的高位。似乎也符合：小端CPU通常采用的是LSB 0 位序的惯例。但是这里有意个疑问：在大端CPU中，上面两个例子的结果是什么呢？结果和小端CPU一样吗？结果唯一吗？因为前面我们说过：“ 但是大端CPU却有可能采用 LSB 0 位序也有可能采用的是MSB 0 位序 ”

发布者：admin，转转请注明出处：http://www.yc00.com/web/1754939964a5217969.html

字节大小语言

admin

网站建设
Andrej Karpathy 最新AI讲座（3个半小时）：Deep Dive into LLMs like ChatGPT（深入探索像ChatGPT这样的大语言模型）
【必看珍藏】2月6日，安德烈·卡帕西最新AI普及课：深入探索像ChatGPT这样的大语言模型｜Andrej Karpathy视频国内地址：https:
admin
1月前
220
网站建设
无PS只需几步操作轻松改变图片尺寸大小,而且一点都不失真！
在生活中我们总会用到各种各样的图片尺寸，怎么修改图片尺寸大呢，当然 photoshop 可以改变尺寸，但是并不是人人都会photoshop这样专业的软件,即使会使用ph
admin
1月前
230
网站建设
u盘无法格式化，可以识别但是不显示大小，属性中查看总共和剩余大小都是0
我是用这个网址的第三个办法解决的，最后一步我将fat32替换为ntfs，直接拯救了我的u盘。废话少说，直接给网址https:www.disktoolcontent-
admin
1月前
150
网站建设
解决“Windows无法格式化U盘的问题”，包括64gU盘格式化FAT32，“虚拟磁盘服务错误:卷大小太大”问题
借鉴网上很多大佬的方法，也试了多次，记录分享最起作用的一个。1.快捷键“winR”打开命令提示符，输入“CMD”，然后回车，跳出黑
admin
1月前
190
网站建设
在格式化U盘时分配单元大小设置多少合适？
在格式化U盘时分配单元大小设置多少合适？播报文章原创 | 浏览：154671 | 更新：2019-04-09 20:23 1
admin
1月前
170
网站建设
【免费下载】 Win10 中文语言包下载指南：轻松切换系统语言，提升操作体验
Win10 中文语言包下载指南：轻松切换系统语言，提升操作体验【下载地址】Win10中文语言包下载指南分享 Win10 中文语言包下载指南本仓库提供了一个资源文件，用于下载
admin
1月前
130
网站建设
win10上C语言环境安装MinGW-w64-8.1.0的下载和安装
MinGW-w64-8.1.0的下载和安装 MinGW-w64-install.exe的下载官网下载https:sourceforgeprojectsmingw-w64files 运行mingw-w64-install.
admin
1月前
190
网站建设
html5是万维网的核心语言,Html5-万维网的核心技术语言
Html5-万维网的核心语言，网页的展现离不开html5，在经过多年发展与研究后，html5终于比较正式了，目前支持Html5的浏览器包括Firefox
admin
1月前
200
网站建设
【C语言初级课程详解】第40课时-C语言C++运用
很多同学在大学都学过C和C++，那么C和C++就业怎么样？薪资高吗？小编今天就给大家详细解读一下。学c++ 好不好？ C++ 语言广泛的用于基础软件、桌面系统、网络通信、音频视频、游戏娱乐等诸多领域。是世界上使用最广泛的编程语言之一。 C
admin
1月前
200
网站建设
LLM 和生成式 AI 简介：LLM 架构、提示词工程和 LLM 配置 & 大型语言模型中的思维链（CoT）：介绍和应用
Source : Generative AI with Large Language Models | Coursera 资料来源：https:www.courseralearngenerative-ai-with-llms 目录
admin
1月前
200
网站建设
AI之MLM：《MM-LLMs: Recent Advances in MultiModal Large Language Models多模态大语言模型的最新进展》翻译与解读
AI之MLM：《MM-LLMs: Recent Advances in MultiModal Large Language Models多模态大语言模型的最新进展》翻译与解读目录《MM-LLMs: Recent
admin
1月前
170
网站建设
字节跳动开源Coze，开启AI Agent开发新时代？
注：此文章内容均节选自充电了么创始人，CEO兼CTO陈敬雷老师的新书《GPT多模态大模型与AI Agent智能体》（跟我一起学人工智能）【陈敬雷编著】【
admin
1月前
160
网站建设
探索和表征大型语言模型在嵌入式系统开发和调试中的应用
这篇论文的标题是《Exploring and Characterizing Large Language Models for Embedded System Development and Debugging》，作者
admin
1月前
180
网站建设
c语言编程实现英汉翻译词典代码
以下是一个简单的英汉翻译程序示例，使用数组来存储一些常见的单词和其对应的翻译：c复制#include <stdio.h>#include <string.h>
admin
1月前
200
网站建设
光盘显示0字节可用_教你怎么用光盘重装系统
怎么用光盘重装系统？很多小伙伴只会系统光盘安装系统的方法，除了系统光盘重装系统，还有硬盘安装、一键重装、U盘重装、Ghost重装系统等等方法，下面除了教
admin
1月前
100
网站建设
字节跳动推出新项目：DreamActor-M1 实现了 Runway Act 功能，改变未来视频创作的游戏规则
字节跳动推出新项目：DreamActor-M1 实现了 Runway Act 功能，改变未来视频创作的游戏规则 🚀 在现代视频制作和动画行业，创新技
admin
1月前
140
网站建设
查询和修改Linux系统语言
要查询和修改 Linux 系统的语言设置，您可以按照以下步骤进行操作：查询当前系统语言设置：echo $LANG这将显示当前系统的语言设置。例如，en
admin
1月前
170
网站建设
chatgpt出现跨域错误 CORS，设置语言为自动检测即可
Access to fetch at https:ab.chatgptv1rgstr from origin https:chat.openai has been blocked by CORS policy: Response
admin
1月前
200
网站建设
C语言再学习 -- 常用快捷键
下面是Ubuntu 常用快捷键——记住这些会让你更加得心应手哦！ 参看：Ubuntu 12.04常用快捷键——记住这些你就是高手啦！ 桌面 ALTF1: 聚焦到桌面左
admin
1月前
190
网站建设
[EAI-005] 具身视觉语言规划（EVLP）数据集基准汇总
参考论文：Core Challenges in Embodied Vision-Language Planning 论文作者：Jonathan Francis, Nariaki Kitamura,
admin
1月前
240

发表回复

评论列表（0条）

暂无评论

C语言中的位域、字节序、比特序、大小端

发表回复

评论列表（0条）

联系我们

400-800-8888

C语言中的位域、字节序、比特序、大小端

相关推荐

发表回复

评论列表（0条）

联系我们

400-800-8888