﻿id	summary	reporter	owner	description	type	status	priority	milestone	component	version	resolution	keywords	cc
538	HTMLParser fails if a multi-byte character falls on a 4K boundary	hodgestar	hodgestar	"If one does:

{{{
text = u'a' * ((4 * 1024) - 1) + u'\xe6'
events = list(HTMLParser(BytesIO(text.encode('utf-8')),
                                 encoding='utf-8'))
}}}

it produces a truncated-input error because the multi-byte character crosses the boundary of a read from the input file."	defect	closed	major	0.7	Parsing	devel	fixed		
