Python for identifying minimal chromosomal regions among samples2019 Community Moderator ElectionProtected Execution EnvironmentMerge fields in a fileWriting a program for editing .txt data - Python or Unix?Nested 'awk' in a 'while' loop, parse two files line by line and compare column valuesInvoke python script through make commandWhy won't MOTD display output from a Python script it calls outside of /etc/update-motd.d/ unless it's in this directory?Running daemon involving GPIO on Piscript to parse file for two consecutive lines of unequal lengthErrors were encountered while processing: python-minimalhow to substitute strings in a set of files with different strings?

Knife as defense against stray dogs

Error in master's thesis, I do not know what to do

If I cast the Enlarge/Reduce spell on an arrow, what weapon could it count as?

UK Tourist Visa- Enquiry

How to find the largest number(s) in a list of elements, possibly non-unique?

Norwegian Refugee travel document

How do you justify more code being written by following clean code practices?

Do people actually use the word "kaputt" in conversation?

Determine voltage drop over 10G resistors with cheap multimeter

Single word to change groups

How to test the sharpness of a knife?

Isn't the word "experience" wrongly used in this context?

How to balance a monster modification (zombie)?

Why are there no stars visible in cislunar space?

Do I need to convey a moral for each of my blog post?

Are hand made posters acceptable in Academia?

What are the rules for concealing thieves' tools (or items in general)?

Should I be concerned about student access to a test bank?

Does fire aspect on a sword, destroy mob drops?

Friend wants my recommendation but I don't want to

Is xar preinstalled on macOS?

Hot air balloons as primitive bombers

Is there any common country to visit for uk and schengen visa?

Interior of Set Notation

Python for identifying minimal chromosomal regions among samples

2019 Community Moderator ElectionProtected Execution EnvironmentMerge fields in a fileWriting a program for editing .txt data - Python or Unix?Nested 'awk' in a 'while' loop, parse two files line by line and compare column valuesInvoke python script through make commandWhy won't MOTD display output from a Python script it calls outside of /etc/update-motd.d/ unless it's in this directory?Running daemon involving GPIO on Piscript to parse file for two consecutive lines of unequal lengthErrors were encountered while processing: python-minimalhow to substitute strings in a set of files with different strings?

-2

I have multiple sample files (>20) that look like:

chr startpos endpos
1 14930 818094
1 818161 31595422
2 35593931 35865807
2 35868158 104785784

And I would like to output regions that are common among samples. E.g. if sample 1 has:

1 14900 818000

sample 2:

1 15000 605000

sample 3:

1 25000 705000

I would like to output:

1 25000 605000

I would also like to include a majority rule such that e.g if 10 out of totally 20 samples have a minimal region -> output the region. I.e. I would like to have it flexible how many samples that need to have the region for it to be printed to the output.

Does anyone have a python solution for this?

asked 12 hours ago

lindak

2

This question is not really about Unix/Linux, but about programming (coding, algorithms) so it's more appropriate for Stack Overflow rather than this site.

– filbranden
10 hours ago

2

Also note that people at Stack Exchange will typically not want to do your work/homework for you. These are volunteers here, who are happy to help, but you need to show you're making an effort too. So try to solve this on your own and, when you get stumped, ask a question specific about what is happening that is unexpected. You're more likely to get useful answers (and to learn!) that way.

– filbranden
10 hours ago

add a comment |

-2

I have multiple sample files (>20) that look like:

chr startpos endpos
1 14930 818094
1 818161 31595422
2 35593931 35865807
2 35868158 104785784

And I would like to output regions that are common among samples. E.g. if sample 1 has:

1 14900 818000

sample 2:

1 15000 605000

sample 3:

1 25000 705000

I would like to output:

1 25000 605000

Does anyone have a python solution for this?

asked 12 hours ago

lindak

2

This question is not really about Unix/Linux, but about programming (coding, algorithms) so it's more appropriate for Stack Overflow rather than this site.

– filbranden
10 hours ago

2

Also note that people at Stack Exchange will typically not want to do your work/homework for you. These are volunteers here, who are happy to help, but you need to show you're making an effort too. So try to solve this on your own and, when you get stumped, ask a question specific about what is happening that is unexpected. You're more likely to get useful answers (and to learn!) that way.

– filbranden
10 hours ago

add a comment |

-2

I have multiple sample files (>20) that look like:

chr startpos endpos
1 14930 818094
1 818161 31595422
2 35593931 35865807
2 35868158 104785784

And I would like to output regions that are common among samples. E.g. if sample 1 has:

1 14900 818000

sample 2:

1 15000 605000

sample 3:

1 25000 705000

I would like to output:

1 25000 605000

Does anyone have a python solution for this?

asked 12 hours ago

lindak

I have multiple sample files (>20) that look like:

chr startpos endpos
1 14930 818094
1 818161 31595422
2 35593931 35865807
2 35868158 104785784

And I would like to output regions that are common among samples. E.g. if sample 1 has:

1 14900 818000

sample 2:

1 15000 605000

sample 3:

1 25000 705000

I would like to output:

1 25000 605000

Does anyone have a python solution for this?

python bioinformatics

asked 12 hours ago

lindak

asked 12 hours ago

lindak

asked 12 hours ago

lindak

asked 12 hours ago

lindak

asked 12 hours ago

lindak

2

This question is not really about Unix/Linux, but about programming (coding, algorithms) so it's more appropriate for Stack Overflow rather than this site.

– filbranden
10 hours ago

2

Also note that people at Stack Exchange will typically not want to do your work/homework for you. These are volunteers here, who are happy to help, but you need to show you're making an effort too. So try to solve this on your own and, when you get stumped, ask a question specific about what is happening that is unexpected. You're more likely to get useful answers (and to learn!) that way.

– filbranden
10 hours ago

add a comment |

2

This question is not really about Unix/Linux, but about programming (coding, algorithms) so it's more appropriate for Stack Overflow rather than this site.

– filbranden
10 hours ago

2

Also note that people at Stack Exchange will typically not want to do your work/homework for you. These are volunteers here, who are happy to help, but you need to show you're making an effort too. So try to solve this on your own and, when you get stumped, ask a question specific about what is happening that is unexpected. You're more likely to get useful answers (and to learn!) that way.

– filbranden
10 hours ago

This question is not really about Unix/Linux, but about programming (coding, algorithms) so it's more appropriate for Stack Overflow rather than this site.

– filbranden
10 hours ago

Also note that people at Stack Exchange will typically not want to do your work/homework for you. These are volunteers here, who are happy to help, but you need to show you're making an effort too. So try to solve this on your own and, when you get stumped, ask a question specific about what is happening that is unexpected. You're more likely to get useful answers (and to learn!) that way.

– filbranden
10 hours ago

add a comment |

1 Answer
1

active

oldest

votes

Not sure whether this a question for the Unix & Linux stackexchange. It sounds more like a general programming question.

However, I'd encourage you to look into using pandas.

You can import your sample file as a dataframe, specifying tab delineation as follows:

import pandas as pd
df = pd.read_csv('/tmp/samplefile.csv',sep='t')

If you know that startpos will always be smaller than endpos, you could find the output you're looking for by taking the maximum of df['startpos'] and the minimum of df['endpos'].

edited 9 hours ago

answered 10 hours ago

mttpgn

18317

add a comment |

Your Answer

StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "106"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);

else
createEditor();

);

function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);

);

draft saved

draft discarded

StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2funix.stackexchange.com%2fquestions%2f506999%2fpython-for-identifying-minimal-chromosomal-regions-among-samples%23new-answer', 'question_page');

);

Post as a guest

Name

Required, but never shown

1 Answer
1

active

oldest

votes

1 Answer
1

active

oldest

votes

Not sure whether this a question for the Unix & Linux stackexchange. It sounds more like a general programming question.

However, I'd encourage you to look into using pandas.

You can import your sample file as a dataframe, specifying tab delineation as follows:

import pandas as pd
df = pd.read_csv('/tmp/samplefile.csv',sep='t')

If you know that startpos will always be smaller than endpos, you could find the output you're looking for by taking the maximum of df['startpos'] and the minimum of df['endpos'].

edited 9 hours ago

answered 10 hours ago

mttpgn

18317

add a comment |

Not sure whether this a question for the Unix & Linux stackexchange. It sounds more like a general programming question.

However, I'd encourage you to look into using pandas.

You can import your sample file as a dataframe, specifying tab delineation as follows:

import pandas as pd
df = pd.read_csv('/tmp/samplefile.csv',sep='t')

If you know that startpos will always be smaller than endpos, you could find the output you're looking for by taking the maximum of df['startpos'] and the minimum of df['endpos'].

edited 9 hours ago

answered 10 hours ago

mttpgn

18317

add a comment |

Not sure whether this a question for the Unix & Linux stackexchange. It sounds more like a general programming question.

However, I'd encourage you to look into using pandas.

You can import your sample file as a dataframe, specifying tab delineation as follows:

import pandas as pd
df = pd.read_csv('/tmp/samplefile.csv',sep='t')

If you know that startpos will always be smaller than endpos, you could find the output you're looking for by taking the maximum of df['startpos'] and the minimum of df['endpos'].

edited 9 hours ago

answered 10 hours ago

mttpgn

18317

Not sure whether this a question for the Unix & Linux stackexchange. It sounds more like a general programming question.

However, I'd encourage you to look into using pandas.

You can import your sample file as a dataframe, specifying tab delineation as follows:

import pandas as pd
df = pd.read_csv('/tmp/samplefile.csv',sep='t')

If you know that startpos will always be smaller than endpos, you could find the output you're looking for by taking the maximum of df['startpos'] and the minimum of df['endpos'].

edited 9 hours ago

answered 10 hours ago

mttpgn

18317

edited 9 hours ago

answered 10 hours ago

mttpgn

18317

answered 10 hours ago

mttpgn

18317

answered 10 hours ago

mttpgn

18317

add a comment |

draft saved

draft discarded

Thanks for contributing an answer to Unix & Linux Stack Exchange!

Please be sure to answer the question. Provide details and share your research!

But avoid …

Asking for help, clarification, or responding to other answers.

Making statements based on opinion; back them up with references or personal experience.

To learn more, see our tips on writing great answers.

draft saved

draft discarded

Post as a guest

Name

Required, but never shown

Name

Required, but never shown

Name

Required, but never shown

This page is only for reference, If you need detailed information, please check here

搜尋此網誌

Ygtjki

1 Answer
1

Your Answer

Post as a guest

1 Answer
1

1 Answer
1

Post as a guest

Popular posts from this blog

Àrd-bhaile Cathair chruinne/Baile mòr cruinne | Artagailean ceangailte | Clàr-taice na seòladaireachd

1 Answer 1

Your Answer

Sign up or log in

Post as a guest

Post as a guest

1 Answer 1

1 Answer 1

Sign up or log in

Post as a guest

Post as a guest

Sign up or log in

Post as a guest

Sign up or log in

Post as a guest

Sign up or log in

Post as a guest

Popular posts from this blog

Àrd-bhaile Cathair chruinne/Baile mòr cruinne | Artagailean ceangailte | Clàr-taice na seòladaireachd

1 Answer
1

1 Answer
1

1 Answer
1