问题:如何使用Python计算目录中的文件数
我需要计算使用Python的目录中的文件数。
我猜最简单的方法是len(glob.glob('*'))
,但这也将目录本身视为文件。
有什么方法可以只计算目录中的文件吗?
I need to count the number of files in a directory using Python.
I guess the easiest way is len(glob.glob('*'))
, but that also counts the directory itself as a file.
Is there any way to count only the files in a directory?
回答 0
os.listdir()
比使用会更有效率glob.glob
。要测试文件名是否是普通文件(而不是目录或其他实体),请使用os.path.isfile()
:
import os, os.path
# simple version for working with CWD
print len([name for name in os.listdir('.') if os.path.isfile(name)])
# path joining version for other paths
DIR = '/tmp'
print len([name for name in os.listdir(DIR) if os.path.isfile(os.path.join(DIR, name))])
os.listdir()
will be slightly more efficient than using glob.glob
. To test if a filename is an ordinary file (and not a directory or other entity), use os.path.isfile()
:
import os, os.path
# simple version for working with CWD
print len([name for name in os.listdir('.') if os.path.isfile(name)])
# path joining version for other paths
DIR = '/tmp'
print len([name for name in os.listdir(DIR) if os.path.isfile(os.path.join(DIR, name))])
回答 1
import os
path, dirs, files = next(os.walk("/usr/lib"))
file_count = len(files)
import os
path, dirs, files = next(os.walk("/usr/lib"))
file_count = len(files)
回答 2
对于所有类型的文件,都包含以下子目录:
import os
list = os.listdir(dir) # dir is your directory path
number_files = len(list)
print number_files
仅文件(避免使用子目录):
import os
onlyfiles = next(os.walk(dir))[2] #dir is your directory path as string
print len(onlyfiles)
For all kind of files, subdirectories included:
import os
list = os.listdir(dir) # dir is your directory path
number_files = len(list)
print number_files
Only files (avoiding subdirectories):
import os
onlyfiles = next(os.walk(dir))[2] #dir is your directory path as string
print len(onlyfiles)
回答 3
回答 4
如果要计算目录中的所有文件(包括子目录中的文件),则最Python化的方法是:
import os
file_count = sum(len(files) for _, _, files in os.walk(r'C:\Dropbox'))
print(file_count)
我们使用的总和要比显式添加文件计数(正在等待的时间)更快
If you want to count all files in the directory – including files in subdirectories, the most pythonic way is:
import os
file_count = sum(len(files) for _, _, files in os.walk(r'C:\Dropbox'))
print(file_count)
We use sum that is faster than explicitly adding the file counts (timings pending)
回答 5
import os
print len(os.listdir(os.getcwd()))
import os
print len(os.listdir(os.getcwd()))
回答 6
def directory(path,extension):
list_dir = []
list_dir = os.listdir(path)
count = 0
for file in list_dir:
if file.endswith(extension): # eg: '.txt'
count += 1
return count
def directory(path,extension):
list_dir = []
list_dir = os.listdir(path)
count = 0
for file in list_dir:
if file.endswith(extension): # eg: '.txt'
count += 1
return count
回答 7
我很惊讶没有人提到os.scandir
:
def count_files(dir):
return len([1 for x in list(os.scandir(dir)) if x.is_file()])
I am surprised that nobody mentioned os.scandir
:
def count_files(dir):
return len([1 for x in list(os.scandir(dir)) if x.is_file()])
回答 8
这可以使用os.listdir
并适用于任何目录:
import os
directory = 'mydirpath'
number_of_files = len([item for item in os.listdir(directory) if os.path.isfile(os.path.join(directory, item))])
可以使用生成器简化此过程,并通过以下方法使速度更快一点:
import os
isfile = os.path.isfile
join = os.path.join
directory = 'mydirpath'
number_of_files = sum(1 for item in os.listdir(directory) if isfile(join(directory, item)))
This uses os.listdir
and works for any directory:
import os
directory = 'mydirpath'
number_of_files = len([item for item in os.listdir(directory) if os.path.isfile(os.path.join(directory, item))])
this can be simplified with a generator and made a little bit faster with:
import os
isfile = os.path.isfile
join = os.path.join
directory = 'mydirpath'
number_of_files = sum(1 for item in os.listdir(directory) if isfile(join(directory, item)))
回答 9
def count_em(valid_path):
x = 0
for root, dirs, files in os.walk(valid_path):
for f in files:
x = x+1
print "There are", x, "files in this directory."
return x
取自这篇文章
def count_em(valid_path):
x = 0
for root, dirs, files in os.walk(valid_path):
for f in files:
x = x+1
print "There are", x, "files in this directory."
return x
Taked from this post
回答 10
import os
def count_files(in_directory):
joiner= (in_directory + os.path.sep).__add__
return sum(
os.path.isfile(filename)
for filename
in map(joiner, os.listdir(in_directory))
)
>>> count_files("/usr/lib")
1797
>>> len(os.listdir("/usr/lib"))
2049
import os
def count_files(in_directory):
joiner= (in_directory + os.path.sep).__add__
return sum(
os.path.isfile(filename)
for filename
in map(joiner, os.listdir(in_directory))
)
>>> count_files("/usr/lib")
1797
>>> len(os.listdir("/usr/lib"))
2049
回答 11
卢克的代码重新格式化。
import os
print len(os.walk('/usr/lib').next()[2])
Luke’s code reformat.
import os
print len(os.walk('/usr/lib').next()[2])
回答 12
这是我发现有用的简单单行命令:
print int(os.popen("ls | wc -l").read())
Here is a simple one-line command that I found useful:
print int(os.popen("ls | wc -l").read())
回答 13
虽然我同意@DanielStutzbach提供的答案:os.listdir()
将比使用效率更高glob.glob
。
但是,如果要计算文件夹中特定文件的数量,则需要额外的精度len(glob.glob())
。例如,如果您要计算要使用的文件夹中的所有pdf,请执行以下操作:
pdfCounter = len(glob.glob1(myPath,"*.pdf"))
While I agree with the answer provided by @DanielStutzbach: os.listdir()
will be slightly more efficient than using glob.glob
.
However, an extra precision, if you do want to count the number of specific files in folder, you want to use len(glob.glob())
. For instance if you were to count all the pdfs in a folder you want to use:
pdfCounter = len(glob.glob1(myPath,"*.pdf"))
回答 14
很简单:
print(len([iq for iq in os.scandir('PATH')]))
它只计算目录中的文件数,我使用列表推导技术遍历特定目录,以返回所有文件。“ len(返回列表)”返回文件数。
It is simple:
print(len([iq for iq in os.scandir('PATH')]))
it simply counts number of files in directory , i have used list comprehension technique to iterate through specific directory returning all files in return . “len(returned list)” returns number of files.
回答 15
import os
total_con=os.listdir('<directory path>')
files=[]
for f_n in total_con:
if os.path.isfile(f_n):
files.append(f_n)
print len(files)
import os
total_con=os.listdir('<directory path>')
files=[]
for f_n in total_con:
if os.path.isfile(f_n):
files.append(f_n)
print len(files)
回答 16
如果您将使用操作系统的标准外壳,则可以更快地获得结果,而不是使用纯pythonic方式。
Windows示例:
import os
import subprocess
def get_num_files(path):
cmd = 'DIR \"%s\" /A-D /B /S | FIND /C /V ""' % path
return int(subprocess.check_output(cmd, shell=True))
If you’ll be using the standard shell of the operating system, you can get the result much faster rather than using pure pythonic way.
Example for Windows:
import os
import subprocess
def get_num_files(path):
cmd = 'DIR \"%s\" /A-D /B /S | FIND /C /V ""' % path
return int(subprocess.check_output(cmd, shell=True))
回答 17
我找到了另一个可能是正确的答案。
for root, dirs, files in os.walk(input_path):
for name in files:
if os.path.splitext(name)[1] == '.TXT' or os.path.splitext(name)[1] == '.txt':
datafiles.append(os.path.join(root,name))
print len(files)
I found another answer which may be correct as accepted answer.
for root, dirs, files in os.walk(input_path):
for name in files:
if os.path.splitext(name)[1] == '.TXT' or os.path.splitext(name)[1] == '.txt':
datafiles.append(os.path.join(root,name))
print len(files)
回答 18
我用过glob.iglob
类似的目录结构
data
└───train
│ └───subfolder1
│ | │ file111.png
│ | │ file112.png
│ | │ ...
│ |
│ └───subfolder2
│ │ file121.png
│ │ file122.png
│ │ ...
└───test
│ file221.png
│ file222.png
以下两个选项均返回4(如预期,即不计算子文件夹本身)
len(list(glob.iglob("data/train/*/*.png", recursive=True)))
sum(1 for i in glob.iglob("data/train/*/*.png"))
I used glob.iglob
for a directory structure similar to
data
└───train
│ └───subfolder1
│ | │ file111.png
│ | │ file112.png
│ | │ ...
│ |
│ └───subfolder2
│ │ file121.png
│ │ file122.png
│ │ ...
└───test
│ file221.png
│ file222.png
Both of the following options return 4 (as expected, i.e. does not count the subfolders themselves)
len(list(glob.iglob("data/train/*/*.png", recursive=True)))
sum(1 for i in glob.iglob("data/train/*/*.png"))
回答 19
我这样做了,这返回了文件夹(Attack_Data)中文件的数量。
import os
def fcount(path):
#Counts the number of files in a directory
count = 0
for f in os.listdir(path):
if os.path.isfile(os.path.join(path, f)):
count += 1
return count
path = r"C:\Users\EE EKORO\Desktop\Attack_Data" #Read files in folder
print (fcount(path))
i did this and this returned the number of files in the folder(Attack_Data)…this works fine.
import os
def fcount(path):
#Counts the number of files in a directory
count = 0
for f in os.listdir(path):
if os.path.isfile(os.path.join(path, f)):
count += 1
return count
path = r"C:\Users\EE EKORO\Desktop\Attack_Data" #Read files in folder
print (fcount(path))