Pandas select on multiple columns then replace

up vote
2
down vote

favorite

I am trying to do a multiple column select then replace in pandas

df:

a  b  c  d  e

0  1  1  0  none

0  0  0  1  none

1  0  0  0  none

0  0  0  0  none

select where any or all of a, b, c, d are non zero

i, j = np.where(df)

s=pd.Series(dict(zip(zip(i, j), 

  df.columns[j]))).reset_index(-1, drop=True)

Now I want to replace the values in column e by the series:

df['e'] = s.values

so that e looks like:

b, c 

d

a

none

But the problem is that the lengths of the series are different to the number of rows in the dataframe.

Any idea on how I can do this?

edited Nov 11 at 4:41

asked Nov 11 at 4:40

proximacentauri

5271420

1

Your commend code worked perfectly. I couldn't get the 'duplicate' answer to work. So from that perspective isnt a 100% duplicate
– proximacentauri
Nov 11 at 4:50

add a comment |

up vote
2
down vote

favorite

I am trying to do a multiple column select then replace in pandas

df:

a  b  c  d  e

0  1  1  0  none

0  0  0  1  none

1  0  0  0  none

0  0  0  0  none

select where any or all of a, b, c, d are non zero

i, j = np.where(df)

s=pd.Series(dict(zip(zip(i, j), 

  df.columns[j]))).reset_index(-1, drop=True)

Now I want to replace the values in column e by the series:

df['e'] = s.values

so that e looks like:

b, c 

d

a

none

But the problem is that the lengths of the series are different to the number of rows in the dataframe.

Any idea on how I can do this?

edited Nov 11 at 4:41

asked Nov 11 at 4:40

proximacentauri

5271420

1

Your commend code worked perfectly. I couldn't get the 'duplicate' answer to work. So from that perspective isnt a 100% duplicate
– proximacentauri
Nov 11 at 4:50

add a comment |

up vote
2
down vote

favorite

I am trying to do a multiple column select then replace in pandas

df:

a  b  c  d  e

0  1  1  0  none

0  0  0  1  none

1  0  0  0  none

0  0  0  0  none

select where any or all of a, b, c, d are non zero

i, j = np.where(df)

s=pd.Series(dict(zip(zip(i, j), 

  df.columns[j]))).reset_index(-1, drop=True)

Now I want to replace the values in column e by the series:

df['e'] = s.values

so that e looks like:

b, c 

d

a

none

But the problem is that the lengths of the series are different to the number of rows in the dataframe.

Any idea on how I can do this?

edited Nov 11 at 4:41

asked Nov 11 at 4:40

proximacentauri

5271420

I am trying to do a multiple column select then replace in pandas

df:

a  b  c  d  e

0  1  1  0  none

0  0  0  1  none

1  0  0  0  none

0  0  0  0  none

select where any or all of a, b, c, d are non zero

i, j = np.where(df)

s=pd.Series(dict(zip(zip(i, j), 

  df.columns[j]))).reset_index(-1, drop=True)

Now I want to replace the values in column e by the series:

df['e'] = s.values

so that e looks like:

b, c 

d

a

none

But the problem is that the lengths of the series are different to the number of rows in the dataframe.

Any idea on how I can do this?

python pandas

edited Nov 11 at 4:41

asked Nov 11 at 4:40

proximacentauri

5271420

edited Nov 11 at 4:41

asked Nov 11 at 4:40

proximacentauri

5271420

edited Nov 11 at 4:41

asked Nov 11 at 4:40

proximacentauri

5271420

asked Nov 11 at 4:40

proximacentauri

5271420

asked Nov 11 at 4:40

proximacentauri

5271420

1

Your commend code worked perfectly. I couldn't get the 'duplicate' answer to work. So from that perspective isnt a 100% duplicate
– proximacentauri
Nov 11 at 4:50

add a comment |

1

Your commend code worked perfectly. I couldn't get the 'duplicate' answer to work. So from that perspective isnt a 100% duplicate
– proximacentauri
Nov 11 at 4:50

Your commend code worked perfectly. I couldn't get the 'duplicate' answer to work. So from that perspective isnt a 100% duplicate
– proximacentauri
Nov 11 at 4:50

add a comment |

2 Answers
2

active

oldest

votes

up vote
2
down vote

accepted

Use DataFrame.dot for product with columns names, add rstrip, last add numpy.where for replace empty strings to None:

e = df.dot(df.columns + ', ').str.rstrip(', ')

df['e'] = np.where(e.astype(bool), e, None)

print (df)

   a  b  c  d     e

0  0  1  1  0  b, c

1  0  0  0  1     d

2  1  0  0  0     a

3  0  0  0  0  None

answered Nov 11 at 4:49

jezrael

310k21246321

add a comment |

up vote
2
down vote

You can locate the 1's and use their locations as boolean indexes into the dataframe columns:

df['e'] = (df==1).apply(lambda x: df.columns[x], axis=1)

                 .str.join(",").replace('','none')

#   a  b  c  d     e

#0  0  1  1  0   b,c

#1  0  0  0  1     d

#2  1  0  0  0     a

#3  0  0  0  0  none

edited Nov 11 at 4:55

answered Nov 11 at 4:55

DYZ

24.3k61948

add a comment |

Your Answer

StackExchange.ifUsing("editor", function () {
StackExchange.using("externalEditor", function () {
StackExchange.using("snippets", function () {
StackExchange.snippets.init();
});
});
}, "code-snippets");

StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "1"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});

}
});

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53245903%2fpandas-select-on-multiple-columns-then-replace%23new-answer', 'question_page');
}
);

Post as a guest

Name

Required, but never shown

2 Answers
2

active

oldest

votes

2 Answers
2

active

oldest

votes

up vote
2
down vote

accepted

Use DataFrame.dot for product with columns names, add rstrip, last add numpy.where for replace empty strings to None:

e = df.dot(df.columns + ', ').str.rstrip(', ')

df['e'] = np.where(e.astype(bool), e, None)

print (df)

   a  b  c  d     e

0  0  1  1  0  b, c

1  0  0  0  1     d

2  1  0  0  0     a

3  0  0  0  0  None

answered Nov 11 at 4:49

jezrael

310k21246321

add a comment |

up vote
2
down vote

accepted

Use DataFrame.dot for product with columns names, add rstrip, last add numpy.where for replace empty strings to None:

e = df.dot(df.columns + ', ').str.rstrip(', ')

df['e'] = np.where(e.astype(bool), e, None)

print (df)

   a  b  c  d     e

0  0  1  1  0  b, c

1  0  0  0  1     d

2  1  0  0  0     a

3  0  0  0  0  None

answered Nov 11 at 4:49

jezrael

310k21246321

add a comment |

up vote
2
down vote

accepted

Use DataFrame.dot for product with columns names, add rstrip, last add numpy.where for replace empty strings to None:

e = df.dot(df.columns + ', ').str.rstrip(', ')

df['e'] = np.where(e.astype(bool), e, None)

print (df)

   a  b  c  d     e

0  0  1  1  0  b, c

1  0  0  0  1     d

2  1  0  0  0     a

3  0  0  0  0  None

answered Nov 11 at 4:49

jezrael

310k21246321

Use DataFrame.dot for product with columns names, add rstrip, last add numpy.where for replace empty strings to None:

e = df.dot(df.columns + ', ').str.rstrip(', ')

df['e'] = np.where(e.astype(bool), e, None)

print (df)

   a  b  c  d     e

0  0  1  1  0  b, c

1  0  0  0  1     d

2  1  0  0  0     a

3  0  0  0  0  None

answered Nov 11 at 4:49

jezrael

310k21246321

answered Nov 11 at 4:49

jezrael

310k21246321

answered Nov 11 at 4:49

jezrael

310k21246321

answered Nov 11 at 4:49

jezrael

310k21246321

add a comment |

up vote
2
down vote

You can locate the 1's and use their locations as boolean indexes into the dataframe columns:

df['e'] = (df==1).apply(lambda x: df.columns[x], axis=1)

                 .str.join(",").replace('','none')

#   a  b  c  d     e

#0  0  1  1  0   b,c

#1  0  0  0  1     d

#2  1  0  0  0     a

#3  0  0  0  0  none

edited Nov 11 at 4:55

answered Nov 11 at 4:55

DYZ

24.3k61948

add a comment |

up vote
2
down vote

You can locate the 1's and use their locations as boolean indexes into the dataframe columns:

df['e'] = (df==1).apply(lambda x: df.columns[x], axis=1)

                 .str.join(",").replace('','none')

#   a  b  c  d     e

#0  0  1  1  0   b,c

#1  0  0  0  1     d

#2  1  0  0  0     a

#3  0  0  0  0  none

edited Nov 11 at 4:55

answered Nov 11 at 4:55

DYZ

24.3k61948

add a comment |

up vote
2
down vote

You can locate the 1's and use their locations as boolean indexes into the dataframe columns:

df['e'] = (df==1).apply(lambda x: df.columns[x], axis=1)

                 .str.join(",").replace('','none')

#   a  b  c  d     e

#0  0  1  1  0   b,c

#1  0  0  0  1     d

#2  1  0  0  0     a

#3  0  0  0  0  none

edited Nov 11 at 4:55

answered Nov 11 at 4:55

DYZ

24.3k61948

You can locate the 1's and use their locations as boolean indexes into the dataframe columns:

df['e'] = (df==1).apply(lambda x: df.columns[x], axis=1)

                 .str.join(",").replace('','none')

#   a  b  c  d     e

#0  0  1  1  0   b,c

#1  0  0  0  1     d

#2  1  0  0  0     a

#3  0  0  0  0  none

edited Nov 11 at 4:55

answered Nov 11 at 4:55

DYZ

24.3k61948

edited Nov 11 at 4:55

answered Nov 11 at 4:55

DYZ

24.3k61948

answered Nov 11 at 4:55

DYZ

24.3k61948

answered Nov 11 at 4:55

DYZ

24.3k61948

add a comment |

draft saved

draft discarded

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Name

Required, but never shown

Name

Required, but never shown

This page is only for reference, If you need detailed information, please check here

搜尋此網誌

Nrthugu