Python 实用宝典

Question 1

采取以下数据框架：

x = np.tile(np.arange(3),3)
y = np.repeat(np.arange(3),3)
df = pd.DataFrame({"x": x, "y": y})

我需要x首先对其进行排序，然后仅需按其进行排序y：

df2 = df.sort(["x", "y"])

如何更改索引，使其再次上升。即我怎么得到这个：

我尝试了以下方法。不幸的是，它根本不会改变索引：

df2.reindex(np.arange(len(df2.index)))

Question 2

Take the following data-frame:

x = np.tile(np.arange(3),3)
y = np.repeat(np.arange(3),3)
df = pd.DataFrame({"x": x, "y": y})

I need to sort it by x first, and only second by y:

df2 = df.sort(["x", "y"])

How can I change the index such that it is ascending again. I.e. how do I get this:

I have tried the following. Unfortunately, it doesn’t change the index at all:

df2.reindex(np.arange(len(df2.index)))

Question 3

您可以使用来重置索引，reset_index以获取默认索引0、1、2，…，n-1（并用于drop=True指示您要删除现有索引，而不是将其作为附加列添加到数据框中）。：

In [19]: df2 = df2.reset_index(drop=True)

In [20]: df2
Out[20]:
   x  y
0  0  0
1  0  1
2  0  2
3  1  0
4  1  1
5  1  2
6  2  0
7  2  1
8  2  2

Question 4

You can reset the index using reset_index to get back a default index of 0, 1, 2, …, n-1 (and use drop=True to indicate you want to drop the existing index instead of adding it as an additional column to your dataframe):

In [19]: df2 = df2.reset_index(drop=True)

In [20]: df2
Out[20]:
   x  y
0  0  0
1  0  1
2  0  2
3  1  0
4  1  1
5  1  2
6  2  0
7  2  1
8  2  2

Question 5

df.sort()已弃用，请使用df.sort_values(...)：https : //pandas.pydata.org/pandas-docs/stable/generation/pandas.DataFrame.sort_values.html

然后按照乔里斯的回答做 df.reset_index(drop=True)

Question 6

df.sort() is deprecated, use df.sort_values(...): https://pandas.pydata.org/pandas-docs/stable/generated/pandas.DataFrame.sort_values.html

Then follow joris’ answer by doing df.reset_index(drop=True)

Question 7

由于pandas 1.0.0df.sort_values具有一个新参数ignore_index，可以满足您的实际需要：

In [1]: df2 = df.sort_values(by=['x','y'],ignore_index=True)

In [2]: df2
Out[2]:
   x  y
0  0  0
1  0  1
2  0  2
3  1  0
4  1  1
5  1  2
6  2  0
7  2  1
8  2  2

Question 8

Since pandas 1.0.0 df.sort_values has a new parameter ignore_index which does exactly what you need:

In [1]: df2 = df.sort_values(by=['x','y'],ignore_index=True)

In [2]: df2
Out[2]:
   x  y
0  0  0
1  0  1
2  0  2
3  1  0
4  1  1
5  1  2
6  2  0
7  2  1
8  2  2

Question 9

您可以使用来设置新索引set_index：

df2.set_index(np.arange(len(df2.index)))

输出：

Question 10

You can set new indices by using set_index:

df2.set_index(np.arange(len(df2.index)))

Output:

Python 实用宝典

排序数据框后更新索引

问题：排序数据框后更新索引

回答 0

回答 1

回答 2

回答 3

有趣好用的Python教程