Tutorial Week 7 : COMP3231/9201/3891/9283 Operating Systems 2025/T2

Tutorial Week 7

Why does Linux pre-allocate up to 8 blocks on a write to a file.

Pre-allocating provides better locality when many writes to independent files are interleaved.

What is the structure of the contents of a directory? Does it contain attributes such as creation times of files? If not, where might this information be stored?
- See lecture slides.
- No, directories only have a name-to-inode mapping
- Attributes of the file are stored in the inode itself.

The Unix inode structure contains a reference count. What is the reference count for? Why can't we just remove the inode without checking the reference count when a file is deleted?

Inodes contain a reference count due to hard links. The reference count is equal to the number of directory entries that reference the inode. For hard-linked files, multiple directory entries reference a single inode. The inode must not be removed until no directory entries are left (ie, the reference count is 0) to ensure that the filesystem remains consistent.

Inode-based filesystems typically divide a file system partition into block groups. Each block group consists of a number of contiguous physical disk blocks. Inodes for a given block group are stored in the same physical location as the block groups. What are the advantages of this scheme? Are they any disadvantages?
- Each group contains a redundant superblock. This make the file system more robust to disk block failures.
- Block groups keep the inodes physically closer to the files they refer to than they would be (on average) on a system without block groups. Since accessing and updating files also involves accessing or updating its inode, having the inode and the file's block close together reduces disk seek time, and thus improves performance. The OS must take care that all blocks remain within the block group of their inode.

A typical UNIX inode stores both the file's size and the number of blocks currently used to store the file. Why store both? Should not blocks = size / block size?
Blocks used to store the file are only indirectly related to file size.
- The blocks used to store a file includes and indirect blocks used by the filesystem to keep track of the file data blocks themselves.
- File systems only store blocks that actually contain file data. Sparsely populated files can have large regions that are unused within a file.

How does adding journalling to a file system avoid corruption in the presence of unexpected power failures.

Simply speaking, adding a journal addresses the issue by grouping file system updates into transactions that should either completely fail or succeed. These transactions are logged prior to manipulating the file system. In the presence of failure the transaction can be completed by replaying the updates remaining in the log.