Wireshark · Wireshark-dev: Re: [Wireshark-dev] Wireshark memory handling

Wireshark-dev: Re: [Wireshark-dev] Wireshark memory handling

From: Guy Harris <guy@xxxxxxxxxxxx>

Date: Fri, 9 Oct 2009 19:08:08 -0700


On Oct 9, 2009, at 7:43 AM, Jeff Morriss wrote:

One advantage of using memory mapped files instead of swap is that if
your OS is swapping, *everything* is slow.  If only Wireshark is, er,
swapping, only Wireshark is slow.

That depends on the OS's policies for managing main memory - and onany policy hints given to the OS by the application. If, for example,when it searches for a page frame to use to satisfy a page fault, ituses the same policy when servicing a page fault for a page backed bya mapped file and when servicing a page fault for a page backed byswap space (an "anonymous" page), the only advantage to memory mappingwould be

1) if the file is mapped into multiple process's address spaces (andeither read-only or not copy-on-write), those processes can share asingle page frame for a page from the file - but that's not the casehere, as I understand it;

2) if the data in anonymous pages is a copy of data from a file,memory-mapping the file even in only one process means that you don'teven temporarily have two copies of the data in memory.

Using memory mapped files would probably help quite a bit with keeping
the UI responsive because only Wireshark's, for example, packet data
would be on disk but the executable pages and "core" memory like the

statistics could be kept in RAM (or at least whatever the OS givesus).

As per my mail to Erlend, the frame data isn't kept in Wireshark'saddress space, although reassembled data is (and frame_data structuresare, and some or all column text is).

However, if Wireshark reads a large capture file, on many OSes theblocks of the file will be brought into the page pool (as, on manyOSes, the "buffer cache" is implemented atop the page pool, so pagesbeing read in with read()/ReadFile() compete for memory with pagesfaulted in - it may even be that a read is done by mapping into thekernel's address space the region of the file being read and copyingfrom that region into the userland buffer space, so that the actualfile system reads are done in response to page faults). *Hopefully*the OS will recognize it as sequential access and, at least, notcompletely blow the page cache if the file is big enough (although, ifyou have enough memory that you *don't* blow the page cache, you mightas well keep the pages in memory; my menagerie of capture files I usefor Wireshark/tcpdump regression testing for some changes can fitentirely in main memory on my machine, so if I run the tests twice ina row, the disk hardly does anything).

References:
- [Wireshark-dev] Wireshark memory handling
  - From: Håvar Aambø Fosstveit
- Re: [Wireshark-dev] Wireshark memory handling
  - From: Erlend Hamberg
- Re: [Wireshark-dev] Wireshark memory handling
  - From: didier
- Re: [Wireshark-dev] Wireshark memory handling
  - From: Erlend Hamberg
- Re: [Wireshark-dev] Wireshark memory handling
  - From: Jeff Morriss

Prev by Date: Re: [Wireshark-dev] Wireshark memory handling
Next by Date: Re: [Wireshark-dev] g_mem-chunk_destroy does not free memory for GLib > 2.8
Previous by thread: Re: [Wireshark-dev] Wireshark memory handling
Next by thread: Re: [Wireshark-dev] Wireshark memory handling
Index(es):
- Date
- Thread