Wireshark-bugs: [Wireshark-bugs] [Bug 2588] New: Bad UTF-8 string from RSS dissector
Date: Wed, 4 Jun 2008 15:50:24 -0700 (PDT)
https://bugs.wireshark.org/bugzilla/show_bug.cgi?id=2588

           Summary: Bad UTF-8 string from RSS dissector
           Product: Wireshark
           Version: 1.0.0
          Platform: PC
        OS/Version: Windows Server 2003
            Status: NEW
          Severity: Normal
          Priority: Medium
         Component: Wireshark
        AssignedTo: wireshark-bugs@xxxxxxxxxxxxx
        ReportedBy: bjoern@xxxxxxxxxxxx


Build Information:
Paste the COMPLETE build information from "Help->About Wireshark", "wireshark
-v", or "tshark -v".
--
Capturing the traffic generated from loading
http://www.apple.com/home/2008/ticker.rss into a PDML file produces an
ill-formed XML document, it has some octets that are not proper UTF-8 sequences
(the RSS document appears to be okay), one example is

...<field name="rss.channel.item.title" showname="\342\200\234Get
Smart\342\200\235 at the Apple Retail Store, Third Street Promenade" size="65"
pos="894" show="\x80Get Smart[raw 0xE2 here]...

The original octet sequence in the RSS document is E2 80 9C Get Smart E2 80 9D.


-- 
Configure bugmail: https://bugs.wireshark.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.