Blocks, block devices and block sizes
A block device is a device you can read blocks from. For example hard disks, cdrom drives and floppies are block devices, but not the keyboard. You can receive data from the keyboard and regard them as blocks, but you cannot seek on the keyboard. You can tell a hard disk "give me block 5433", then block 7707, then block 1807 and you cannot do this with a keyboard, so, a keyboard is no block device.
Block sizes
It is important to understand the ideas behind the block sizes.
Disk block sizes
Typically, a hard disk cannot read less than 512 bytes, if you want to read less, read 512 bytes and discard the rest. This is why dd reads one block à 512 bytes in the following example:
tweedleburg:~ # dd if=/dev/sda1 of=/dev/null count=1 1+0 records in 1+0 records out 512 bytes (512 B) copied, 1.8977e-05 s, 27.0 MB/s
File system block sizes
On the other hand, every file system needs to split up a partition into blocks to store files and file parts. This is why there is a different block size for a file system as you can see here:
$ stat -f . File: "." ID: 84bc409db75bdd32 Namelen: 255 Type: ext2/ext3 Block size: 4096 Fundamental block size: 4096 Blocks: Total: 2531650 Free: 1704179 Available: 1570650 Inodes: Total: 647168 Free: 567361
So, if you store a file in this file system, it will be stored in a 4096-byte-block, that means, even if your file only contains 5 bytes, it will take away 4096 bytes from your disk's capacity:
$ df . Filesystem 1K-blocks Used Available Use% Mounted on /dev/sda1 10126600 3309884 6282600 35% / $ echo hello>world thorsten_staerk_de@instance-2:~$ ls -l world -rw-r--r-- 1 thorsten_staerk_de thorsten_staerk_de 6 Apr 1 06:17 world thorsten_staerk_de@instance-2:~$ df . Filesystem 1K-blocks Used Available Use% Mounted on /dev/sda1 10126600 3309888 6282596 35% / $ du -sch world 4.0K world 4.0K total
Kernel block size
Also the kernel has its own block size. This is relevant e.g. for vmstat. In the vmstat man page you find the statement
All linux blocks are currently 1024 bytes.
So, again another block size when you work with vmstat. This is the block size the Linux kernel uses internally for caching and buffering. It is the most prominent of all block sizes.
Applications
vmstat
vmstat uses 1k blocks
iostat
iostat uses 512B blocks
I/O
There is
- buffered I/O and direct I/O
- async I/O and sync I/O
The difference between buffered I/O and direct I/O is that direct I/O does not use the operating system cache. During the open syscall, you can say that you want direct I/O. More info...
The difference between async I/O and sync I/O is that async I/O reads/writes from/to a file without waiting for the data to actually arrive, but rather having the data sent to a network socket. You must use libaio for this. More info...
How to...
Find out the size of a block device
To find out the size of the block device /dev/sda:
fdisk -l /dev/sda
or, a bit harder to read:
hwinfo --block
Find out the file system stored on a block device
A file system will typically be stored in a partition, not directly in a block device. To find all partitions on /dev/sda use
fdisk -l /dev/sda
To find out what file system is stored on /dev/sda1 use
file -s /dev/sda1
Find out the elevator algorithm
The elevator algorithm can be found like this:
cat /sys/devices/platform/host7/session1/target7\:0\:0/7\:0\:0\:0/block/sdi/queue/scheduler noop anticipatory [deadline] cfq
In this case, the deadline elevator algorithm is active.
Or it may be found like this:
cat /sys/class/block/sdb/queue/scheduler noop deadline [cfq]
You can switch the scheduler like this:
# cat /sys/class/block/sdb/queue/scheduler noop deadline [cfq] # echo "noop">/sys/class/block/sdb/queue/scheduler # cat /sys/class/block/sdb/queue/scheduler [noop] deadline cfq
Associated commands
- dd
- vmstat
- iostat
- stat
- fdisk
- hwinfo --block
- hwinfo --partition
- file -s /dev/sda : finds out the filesystem type of /dev/sda