之前用tcpdump抓包的时候,只要是gzip压缩过的数据就没有办法直接还原原始的数据。这段时间学了一下python正好看里面有gzip模块。今天先尝试了一下解压web server返回的压缩过的数据。测试了一下OK
#!/usr/bin/env python
import urllib2
import gzip
import binascii
from StringIO import StringIO
def gunziptxt(data):
buf = StringIO(data)
of =gzip.GzipFile(fileobj=buf,mode="rb")
outdata=of.read()
return outdata
url="http://127.0.0.1/index.html"
request=urllib2.Request(url,headers={'User-agent':"python urllib browser","Accept-Encoding":'gzip'})
try:
response=urllib2.urlopen(request,timeout=5)
data=response.read()
except:
print "get %s response failed" %url
print "headers:\n",response.info()
if response.info()["content-encoding"] == 'gzip':
print "http resonse is gzip"
outdata=gunziptxt(data)
lbuf=StringIO()
with gzip.GzipFile(mode='wb',fileobj=lbuf) as inf:
inf.write(data)
gziplen=len(lbuf.getvalue())
print "gzip %d and gunzip %d"%(gziplen,len(outdata))
else:
print "http resonse is not gzip"
outdata=data
print "http response:\n",outdata
要获取压缩过的数据的长度只能先借助StingIO,把原始的输出放进去后才能用len来得到。