python高级文件和爬虫
文件后续
转换文字:分辨文件的编码格式
num = open(r'C:\Users\悠然\Desktop\练字\python.txt','r+',encoding='gbk')
text = num.read()
print(text)
转换文字:分辨文件的编码格式
num = open(r'C:\Users\悠然\Desktop\练字\python.txt','r+',encoding='gbk')
text = num.read()
print(text)
如图:
循环读 以行为单位
num = open(r'C:\Users\悠然\Desktop\练字\新建文本文档.txt','r+',encoding='gbk')
text = num.readline()
while len(text)!=0:
print(text,end='')
#再读取一行 下一行
text = num.readline()
num = open(r'C:\Users\悠然\Desktop\练字\新建文本文档.txt','r+',encoding='gbk')
text = num.readline()
while len(text)!=0:
print(text,end='')
#再读取一行 下一行
text = num.readline()
如图:
复制 并替换
foo = open(r'C:\Users\悠然\Desktop\练字\python.txt','r+',encoding='gbk')
num = open(r'C:\Users\悠然\Desktop\练字\python2.txt','w+',encoding='gbk')
lines = foo.readlines()
print(lines)
for li in lines:
num.write(li.replace('哈哈哈','XXX'))
num.close()
foo.close()
print("复制 并替换完毕")
foo = open(r'C:\Users\悠然\Desktop\练字\python.txt','r+',encoding='gbk')
num = open(r'C:\Users\悠然\Desktop\练字\python2.txt','w+',encoding='gbk')
lines = foo.readlines()
print(lines)
for li in lines:
num.write(li.replace('哈哈哈','XXX'))
num.close()
foo.close()
print("复制 并替换完毕")
如图:
小小爬虫
from urllib import request
请求打开一个网页
response = request.urlopen('https://www.taobao.com')
在本地 创建一个文本
xml = open('python_xml.html','w+',encoding='utf-8')
把服务器的相应 处理
html = response.read()
xml.write(str(html.decode('utf-8')))
print('读取完毕')
如图:
图二:
完毕!