Python文件操作方法与技巧详解

Python 提供了许多内置方法用于操作文件对象，使得文件读写和管理更加方便。这些方法可以对文件进行读取、写入、定位、追加等操作。

以下是 文件对象常用方法 的详细讲解和示例：

1. 打开和关闭文件

1.1 打开文件：`open()`

open() 是 Python 打开文件的内置函数，返回一个文件对象。

语法

open(file, mode='r', encoding=None)

参数

file：文件路径（相对路径或绝对路径）。

mode：文件打开模式（如 'r', 'w', 'a' 等）。

encoding：文本文件的编码方式（常用 utf-8 或 gbk）。

1.2 关闭文件：`close()`

close() 方法用于关闭文件。一旦文件被关闭，无法再读取或写入。

示例

file = open("example.txt", "r")
content = file.read()
print(content)
file.close()  # 关闭文件

注意：推荐使用 with 管理文件，自动关闭文件。

with open("example.txt", "r") as file:
    content = file.read()
    print(content)
# 文件在 with 代码块结束后自动关闭

2. 文件读取方法

2.1 `read(size)`

读取文件的内容，返回字符串。

参数 size 指定要读取的字符数。如果省略，则读取整个文件。

示例

with open("example.txt", "r") as file:
    content = file.read(10)  # 读取前 10 个字符
    print(content)

2.2 `readline()`

读取文件中的一行，返回字符串。

如果文件指针到达文件末尾，则返回空字符串。

示例

with open("example.txt", "r") as file:
    line = file.readline()
    print(line.strip())  # 输出第一行内容

2.3 `readlines()`

读取文件的所有行，返回一个包含每行字符串的列表。

示例

with open("example.txt", "r") as file:
    lines = file.readlines()
    print(lines)  # 输出：['第一行\n', '第二行\n', ...]

2.4 文件迭代

文件对象本身是可迭代的，可以逐行遍历文件内容。

示例

with open("example.txt", "r") as file:
    for line in file:
        print(line.strip())  # 输出每一行

3. 文件写入方法

3.1 `write(string)`

将字符串写入文件。

如果文件已存在，模式 'w' 会清空文件内容。

示例：写入文件

with open("example.txt", "w") as file:
    file.write("第一行内容\n")
    file.write("第二行内容\n")

3.2 `writelines(lines)`

将字符串列表写入文件，每个字符串对应一行。

注意：writelines() 不会自动添加换行符，需要手动在字符串中添加。

示例

lines = ["第一行内容\n", "第二行内容\n", "第三行内容\n"]

with open("example.txt", "w") as file:
    file.writelines(lines)

4. 追加模式

4.1 使用 `'a'` 追加内容

打开文件并在末尾追加内容，不会清除原内容。

示例

with open("example.txt", "a") as file:
    file.write("追加内容\n")

5. 文件指针操作

5.1 `tell()`

返回文件指针的当前位置（字节数）。

示例

with open("example.txt", "r") as file:
    print(file.tell())  # 输出：当前指针位置（初始为 0）
    file.read(5)        # 读取 5 个字符
    print(file.tell())  # 输出：指针当前位置（5）

5.2 `seek(offset, whence)`

移动文件指针 到指定位置。

参数：

offset：偏移量（以字节为单位）。

whence：相对位置（可选，默认为 0）。

0：文件开头。

1：当前位置。

2：文件末尾。

示例

with open("example.txt", "r") as file:
    file.seek(5)  # 移动到第 5 个字节
    print(file.read(5))  # 从第 5 个字节开始读取 5 个字符

6. 文件属性

6.1 `name`

返回文件的名称。

示例

with open("example.txt", "r") as file:
    print(file.name)  # 输出：example.txt

6.2 `mode`

返回文件的打开模式。

示例

with open("example.txt", "r") as file:
    print(file.mode)  # 输出：r

6.3 `closed`

检查文件是否已关闭。

示例

file = open("example.txt", "r")
print(file.closed)  # 输出：False
file.close()
print(file.closed)  # 输出：True

7. 二进制文件操作

7.1 `rb` 和 `wb` 模式

rb：以二进制模式读取文件。

wb：以二进制模式写入文件。

示例：读取二进制文件

with open("example.jpg", "rb") as file:
    content = file.read()
    print(content[:10])  # 输出文件的前 10 个字节

示例：写入二进制文件

with open("example.jpg", "rb") as source:
    with open("copy_example.jpg", "wb") as target:
        target.write(source.read())

8. 文件方法一览

方法	描述
`close()`	关闭文件。
`read(size)`	读取指定数量的字符，省略 `size` 则读取整个文件。
`readline()`	读取一行内容。
`readlines()`	读取所有行，返回一个字符串列表。
`write(string)`	将字符串写入文件。
`writelines(lines)`	将字符串列表写入文件。
`tell()`	返回文件指针的当前位置（字节数）。
`seek(offset, whence)`	移动文件指针到指定位置。
`flush()`	刷新文件缓冲区，将内容写入磁盘。

9. 文件缓冲区刷新

9.1 `flush()`

将文件缓冲区中的内容写入磁盘。

在写入数据后，不关闭文件的情况下强制保存到磁盘。

示例

with open("example.txt", "w") as file:
    file.write("写入数据\n")
    file.flush()  # 强制刷新缓冲区，立即写入磁盘

10. 文件的 `with` 语句

with 语句是推荐的文件操作方式，因为它可以自动管理文件的打开和关闭。

示例：使用 `with` 打开文件

with open("example.txt", "r") as file:
    content = file.read()
    print(content)
# 文件在 with 块结束后自动关闭

11. 总结

11.1 文件操作常用方法

方法	描述
`open()`	打开文件，返回文件对象。
`read()`	读取整个文件内容。
`readline()`	逐行读取文件内容（单行）。
`write()`	写入字符串到文件。
`tell()`	返回文件指针的当前位置。
`seek()`	移动文件指针到指定位置。
`writelines()`	写入字符串列表到文件。
`flush()`	刷新文件缓冲区，将内容写入磁盘。

11.2 文件打开模式

模式	描述
`'r'`	以只读模式打开（默认）。
`'w'`	以写入模式打开，清空文件内容。
`'a'`	以追加模式打开，将数据写入文件末尾。
`'b'`	以二进制模式打开（与其他模式组合使用）。
`'t'`	以文本模式打开（默认，与其他模式组合使用）。
`'+'`	以读写模式打开（与其他模式组合使用）。

通过熟练使用这些方法和模式，可以轻松完成日常文件处理任务，例如日志记录、数据存储、文件复制等。

以下是对 Python 文件操作方法 的进一步扩展，包括更多高级用法、性能优化技巧，以及常见问题的解决方案。

12. 文件方法的高级用法

12.1 `flush()` 的使用

flush() 方法强制将缓冲区中的数据写入磁盘，通常用于数据写入后立即保存。

常见场景：实时日志记录

import time

with open("log.txt", "w") as file:
    for i in range(5):
        file.write(f"Log entry {i + 1}\n")
        file.flush()  # 立即写入磁盘
        time.sleep(1)  # 模拟延迟

12.2 文件的上下文管理

with 语句是文件操作的最佳实践，能够避免忘记关闭文件的问题。它会自动调用文件的 __enter__ 和 __exit__ 方法，确保文件被正确关闭，即使出现异常。

示例：自动管理文件资源

with open("example.txt", "r") as file:
    content = file.read()
    print(content)
# 文件在退出 `with` 块时自动关闭

自定义上下文管理器

可以创建自定义的文件管理器类，模仿 with 的行为。

class FileManager:
    def __init__(self, filename, mode):
        self.file = open(filename, mode)
    
    def __enter__(self):
        return self.file  # 返回文件对象
    
    def __exit__(self, exc_type, exc_value, traceback):
        self.file.close()  # 确保文件关闭

with FileManager("example.txt", "r") as file:
    content = file.read()
    print(content)

12.3 文件缓冲区

文件 I/O 使用缓冲区来提高性能。Python 提供以下缓冲模式：

全缓冲（默认）：数据写入缓冲区，文件关闭或缓冲区满时写入磁盘。
行缓冲：适用于文本模式，\n 会触发缓冲区刷新。
无缓冲：数据立即写入磁盘（使用 buffering=0）。

示例：无缓冲模式

with open("example.txt", "w", buffering=0) as file:
    file.write("无缓冲写入数据")  # 立即写入磁盘

12.4 文件的 `truncate()` 方法

truncate(size) 方法用于截断文件到指定大小。如果未指定大小，则从当前位置截断到文件指针位置。

示例：截断文件

with open("example.txt", "w") as file:
    file.write("This is a long sentence.")
    file.truncate(10)  # 截断到前 10 个字符

结果文件内容：This is a

13. 文件操作的性能优化

文件操作可能涉及大量数据读写，以下是优化技巧：

13.1 分块读取文件

对于大文件，可以分块读取以降低内存占用。

示例：逐块读取

with open("large_file.txt", "r") as file:
    while True:
        chunk = file.read(1024)  # 每次读取 1024 字节
        if not chunk:
            break
        print(chunk)

13.2 使用多线程或多进程

对于大文件的处理，可以使用多线程或多进程加速。例如，统计文件字符数。

示例：多线程文件处理

import threading

def count_lines(filename):
    with open(filename, "r") as file:
        return len(file.readlines())

threads = []
files = ["file1.txt", "file2.txt", "file3.txt"]

for file in files:
    thread = threading.Thread(target=count_lines, args=(file,))
    threads.append(thread)
    thread.start()

for thread in threads:
    thread.join()

13.3 使用内存映射

mmap 模块允许将文件映射到内存，从而可以像操作字符串一样操作文件内容。

示例：内存映射文件

import mmap

with open("example.txt", "r+b") as file:
    mm = mmap.mmap(file.fileno(), 0)
    print(mm[:10])  # 读取前 10 个字节
    mm[0:5] = b"Hello"  # 修改文件内容
    mm.close()

13.4 使用生成器处理大文件

生成器的使用可以有效减少内存占用。

示例：逐行读取大文件

def read_large_file(file_path):
    with open(file_path, "r") as file:
        for line in file:
            yield line

for line in read_large_file("large_file.txt"):
    print(line.strip())

14. 常见问题及解决方案

14.1 如何防止文件不存在导致报错？

使用异常处理捕获 FileNotFoundError。

示例

try:
    with open("non_existent_file.txt", "r") as file:
        content = file.read()
except FileNotFoundError:
    print("文件未找到！")

14.2 如何检查文件是否已关闭？

使用文件对象的 closed 属性。

示例

file = open("example.txt", "r")
print(file.closed)  # 输出：False
file.close()
print(file.closed)  # 输出：True

14.3 如何处理编码问题？

当文件使用非 utf-8 编码时，需显式指定编码。

示例：读取 GBK 编码文件

with open("example.txt", "r", encoding="gbk") as file:
    content = file.read()
    print(content)

14.4 如何高效读取文件的最后几行？

可以使用 deque 数据结构读取文件的最后几行，而无需读取整个文件。

示例

from collections import deque

def tail(file_path, n):
    with open(file_path, "r") as file:
        return deque(file, maxlen=n)

last_lines = tail("example.txt", 5)  # 获取最后 5 行
print(''.join(last_lines))

14.5 如何安全地覆盖文件？

为了防止文件覆盖数据丢失，可以先写入临时文件，再重命名。

示例

import os

with open("temp_file.txt", "w") as temp_file:
    temp_file.write("新数据内容")

os.replace("temp_file.txt", "example.txt")  # 替换原文件

15. 文件操作的实际应用案例

15.1 合并多个文件内容

将多个文件的内容合并到一个文件中。

示例

def merge_files(output_file, *input_files):
    with open(output_file, "w") as outfile:
        for file in input_files:
            with open(file, "r") as infile:
                outfile.write(infile.read())

merge_files("merged.txt", "file1.txt", "file2.txt")

15.2 统计文件中的单词数量

示例

def count_words(file_path):
    with open(file_path, "r") as file:
        content = file.read()
        words = content.split()
        return len(words)

word_count = count_words("example.txt")
print(f"单词总数：{word_count}")

15.3 查找文件中的特定字符串

示例

def search_in_file(file_path, keyword):
    with open(file_path, "r") as file:
        for line_number, line in enumerate(file, start=1):
            if keyword in line:
                print(f"第 {line_number} 行找到：{line.strip()}")

search_in_file("example.txt", "Python")

16. 总结

16.1 文件方法概览

方法	描述
`read()`	读取文件内容，返回字符串。
`readline()`	读取一行内容。
`readlines()`	读取所有行，返回字符串列表。
`write()`	写入字符串到文件。
`writelines()`	写入字符串列表到文件。
`tell()`	返回文件指针的当前位置。
`seek()`	移动文件指针到指定位置。
`truncate()`	截断文件到指定大小。
`flush()`	刷新缓冲区，立即将数据写入磁盘。

16.2 文件操作的注意事项

使用 with 管理文件，确保文件正确关闭。
捕获常见异常（如 FileNotFoundError）。
对大文件使用分块读取或生成器。
注意文件编码问题，显式指定编码以避免乱码。
及时释放文件资源，避免文件锁定或占用。

通过熟练掌握 Python 的文件方法和操作技巧，可以高效处理各种文件任务，包括日志管理、数据存储、文件合并等复杂操作！

作者：小宝哥Code

物联沃分享整理
物联沃-IOTWORD物联网 » Python文件操作方法与技巧详解

代码收藏家普通

分享到：