CONCAT多指标大熊猫据帧列

问题描述:

我有一个多指标分组数据框大熊猫类似如下:CONCAT多指标大熊猫据帧列

In [10]: arrays = [np.array(['bar', 'bar', 'baz', 'baz', 'foo', 'foo', 'qux', 'qux']), 
    ....:   np.array(['one', 'two', 'one', 'two', 'one', 'two', 'one', 'two'])] 
    ....: 

In [11]: s = pd.Series(np.random.randn(8), index=arrays) 

In [12]: s 
Out[12]: 
bar one -0.861849 
    two -2.104569 
baz one -0.494929 
    two 1.071804 
foo one 0.721555 
    two -0.706771 
qux one -1.039575 
    two 0.271860 

我如何Concat的第一列的值到第二列?这比“How to concat Pandas dataframe columns”更困难,因为涉及多级数据/分层索引/ MultiIndex。

UPDATE:

我的实际数据实际上是来自于数据库,以适当的名称。诀窍仍然不能在我的最后:

p['Details']= p.index.to_series().str.join(' ') + ' ' + p.astype(str) 
    File "D:\Programs\Anaconda3\lib\site-packages\pandas\core\ops.py", line 995, i 
n f 
    return self._combine_series(other, na_op, fill_value, axis, level) 
    File "D:\Programs\Anaconda3\lib\site-packages\pandas\core\frame.py", line 3446 
, in _combine_series 
    return self._combine_series_infer(other, func, level=level, fill_value=fill_ 
value) 
    File "D:\Programs\Anaconda3\lib\site-packages\pandas\core\frame.py", line 3457 
, in _combine_series_infer 
    return self._combine_match_columns(other, func, level=level, fill_value=fill 
_value) 
    File "D:\Programs\Anaconda3\lib\site-packages\pandas\core\frame.py", line 3469 
, in _combine_match_columns 
    left, right = self.align(other, join='outer', axis=1, level=level, copy=Fals 
e) 
    File "D:\Programs\Anaconda3\lib\site-packages\pandas\core\frame.py", line 2679 
, in align 
    fill_axis=fill_axis, broadcast_axis=broadcast_axis) 
    File "D:\Programs\Anaconda3\lib\site-packages\pandas\core\generic.py", line 37 
84, in align 
    fill_axis=fill_axis) 
    File "D:\Programs\Anaconda3\lib\site-packages\pandas\core\generic.py", line 38 
65, in _align_series 
    return_indexers=True) 
    File "D:\Programs\Anaconda3\lib\site-packages\pandas\core\index.py", line 2233 
, in join 
    return self._join_multi(other, how=how, return_indexers=return_indexers) 
    File "D:\Programs\Anaconda3\lib\site-packages\pandas\core\index.py", line 2326 
, in _join_multi 
    raise ValueError("cannot join with no level specified and no overlapping nam 
es") 
ValueError: cannot join with no level specified and no overlapping names 

现在回家。明天会跟进。

谢谢

那些前两列实际上是一个系列对象的索引。

s.index.to_series().str.join(' ') + ' ' + s.astype(str) 

这可以让你:

s.index.to_series().str.join(' ') + ' ' + s.astype(str) 
s.index.to_series().str.join(' ') + ' ' + s.astype(str) 

bar one  bar one -1.29416824528 
    two bar two -0.417249293315 
baz one baz one -0.474058653156 
    two baz two -0.941660942375 
foo one  foo one -0.41741715261 
    two  foo two 0.739981512301 
qux one  qux one -1.03909641549 
    two  qux two -1.00168469914 
dtype: object 

或者,也许你想保留未改变的浮点值,只是折叠多指标。这被回答here

s.index = s.index.to_series().str.join(' ') 

bar one -1.294168 
bar two -0.417249 
baz one -0.474059 
baz two -0.941661 
foo one -0.417417 
foo two 0.739982 
qux one -1.039096 
qux two -1.001685 
dtype: float64 
+0

哦,是的。第二个技巧的作品。谢谢! – xpt

+0

np,回家,赢了! :-) – piRSquared

+0

新连接在一起的multiindex没有列标题。我怎样才能给新加入的multiindex一个列标题(名称)? Thx – xpt