Skip to content

Handling GRNboost output #40

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
dpcook opened this issue Aug 12, 2018 · 23 comments
Open

Handling GRNboost output #40

dpcook opened this issue Aug 12, 2018 · 23 comments
Assignees
Labels

Comments

@dpcook
Copy link

dpcook commented Aug 12, 2018

Hi there. I've been going through the vignettes and I can't seem to figure out how to get scenicOptions to point to the tsv output file from GRNboost (or a data frame of the results after importing it into R).

Any advice on how to use GRNboost results in the SCENIC pipeline would be greatly appreciated!

@JBreunig
Copy link

JBreunig commented Aug 17, 2018

I think that we are waiting for this function to be implemented:
https://github.com/aertslab/SCENIC/blob/master/man/importGRNBoostResults.Rd

Also referred to here:
https://htmlpreview.github.io/?https://github.com/aertslab/SCENIC/blob/master/inst/doc/SCENIC_Running.html#option-b-grnboost-python

We are now ready to run GRNBoost. You can find the tutorial with some examples in https://arboreto.readthedocs.io/en/latest/examples.html. Once it has finished running, you can import the output from GRNBoost back to R with:

importGRNBoostResults() # TODO (Not implemented yet!)

I've found SCENIC tremendously useful but I look forward to being able to skipping the GENIE3 step as it is very time-consuming compared with what we've seen with GRNBoost.

@jpezoldt
Copy link

What worked for me was to rename the colnames of the GRNBoost output .tsv required by the downstream wrapper runSCENIC_1_coexNetwork2modules() and save the file in the respective scenic int/ directory.

e.g.:
GRNBoost_output <- read.delim("~dir_to_GRNboost_tsv_file/ex_5000SC_network.tsv", header=FALSE)
colnames(GRNBoost_output) <- c("TF","Target","weight")
saveRDS(GRNBoost_output, file="int/1.4_GENIE3_linkList.Rds")

@JBreunig
Copy link

Thanks for the suggestion! I tried that and ended up with the following error:

runSCENIC_1_coexNetwork2modules(scenicOptions)
17:14 Creating TF modules
Error in quantile.default(linkList$weight, probs = c(0.75, 0.9)) :
factors are not allowed

It seems like the GENIE3 weights are <1 while the pySCENIC weights/importance range from .02-160

@jpezoldt
Copy link

Indeed GRNboost has a different range for the scores as compared to GENIE3. But so far I have not observed an impact for my analysis and it also should not have one.

It seems like the GENIE3 weights are <1 while the pySCENIC weights/importance range from .02-160

I checked, whether having the class factor in the GRNBoost_output table will give me the same error message. Indeed, this is the case and if the "weight" column is class() factor I get the identical error message. You could check your input table #columns for the class() and convert the weight column to numeric.
Hope this helps.

@s-aibar s-aibar added the TO DO label Jan 28, 2019
@s-aibar s-aibar self-assigned this Jan 28, 2019
@KabitaBaral1
Copy link

Hi @dpcook @JBreunig @jpezoldt , My dataset is too big for GENIE3 and I have been trying to run GRNboost using the vignette as suggested in
https://arboreto.readthedocs.io/en/latest/examples.html

However, I feel like I have missed something because my code does not seem to run. Could you please help me with which output from SCENIC do you input into GRNboost ?

1.1_genesKept.Rds 1.2_corrMat.Rds cellInfo.Rds colVars.Rds or scenicOptions.Rds?

Thank you

@JBreunig
Copy link

JBreunig commented Apr 1, 2020

I moved on to a scanpy-->pyscenic workflow. I don't believe that I ever got things to work going from pySCENIC to SCENIC but that was before the tutorials were written.

@KabitaBaral1
Copy link

Hi @JBreunig Thank you.

I was wondering if you used Jupyter Notebook or CLI for grn step of pyscenic?
I am using Jupyter Notebook and I keep running into weird errors in this step:

!pyscenic grn {f_loom_path_scenic} {f_tfs} -o adj.csv --num_workers 20

I am not sure if it is version compatibility issues with dependencies and dask or something else.
I followed the following protocol:
https://github.com/aertslab/SCENICprotocol/blob/master/notebooks/PBMC10k_SCENIC-protocol-CLI.ipynb

Thank you

@JBreunig
Copy link

JBreunig commented Apr 10, 2020

I use spyder. But for that particular step, I just go back to the CLI and run it as a typical linux command (from the appropriate directory).

pyscenic grn matrix.csv mm_mgi_tfs.txt -o matrix.adjacencies.tsv --num_workers 15

@KabitaBaral1
Copy link

Thank you @JBreunig Just one more question,
which version of dask, distributed, pandas, msgpack and tornado do you use?
I seem to get errors associated with version incompatibility even when I run it via commandline.

especially this for tornado:
tornado.iostream.StreamClosedError: Stream is closed

Thanks

@JBreunig
Copy link

It looks like:
dask - 1.0.0, distributed - 1.28.1, pandas - 0.25.3, msgpack - 0.6.1 and tornado 6.0.3

@KabitaBaral1
Copy link

Thank you very much @JBreunig.

Hopefully it works. Thanks again

@KabitaBaral1
Copy link

Hi @JBreunig Sorry to bug again,
I tried running my code in linux and I get this very long error. Did you ever run into something like this?
Could you please help me figure this out?

Thank you

/home/kbaral/anaconda3/lib/python3.7/site-packages/dask/config.py:161: YAMLLoadWarning: calling yaml.load() without Loader=... is deprecated, as the default Loader is unsafe. Please read https://msg.pyyaml.org/load for full details.
data = yaml.load(f.read()) or {}

2020-04-10 19:58:06,483 - pyscenic.cli.pyscenic - INFO - Loading expression matrix.

2020-04-10 19:58:12,336 - pyscenic.cli.pyscenic - INFO - Inferring regulatory networks.
/home/kbaral/anaconda3/lib/python3.7/site-packages/dask/config.py:161: YAMLLoadWarning: calling yaml.load() without Loader=... is deprecated, as the default Loader is unsafe. Please read https://msg.pyyaml.org/load for full details.
data = yaml.load(f.read()) or {}
preparing dask client
parsing input
/home/kbaral/anaconda3/lib/python3.7/site-packages/arboreto/algo.py:214: FutureWarning: Method .as_matrix will be removed in a future version. Use .values instead.
expression_matrix = expression_data.as_matrix()
creating dask graph
20 partitions
computing dask graph
'infer_data failed for target KIAA2013' Retry (1/10). Failure caused by ValueError("Regression for target gene KIAA2013 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AC103591.3' Retry (1/10). Failure caused by ValueError("Regression for target gene AC103591.3 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target FBXO6' Retry (1/10). Failure caused by ValueError("Regression for target gene FBXO6 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target ODF2L' Retry (1/10). Failure caused by ValueError("Regression for target gene ODF2L failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AC103591.3' Retry (2/10). Failure caused by ValueError("Regression for target gene AC103591.3 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target KIAA2013' Retry (2/10). Failure caused by ValueError("Regression for target gene KIAA2013 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target FBXO6' Retry (2/10). Failure caused by ValueError("Regression for target gene FBXO6 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target ODF2L' Retry (2/10). Failure caused by ValueError("Regression for target gene ODF2L failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AC092343.1' Retry (1/10). Failure caused by ValueError("Regression for target gene AC092343.1 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AC092807.2' Retry (1/10). Failure caused by ValueError("Regression for target gene AC092807.2 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target BMP8A' Retry (1/10). Failure caused by ValueError("Regression for target gene BMP8A failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AGTRAP' Retry (1/10). Failure caused by ValueError("Regression for target gene AGTRAP failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AC103591.3' Retry (3/10). Failure caused by ValueError("Regression for target gene AC103591.3 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target KIAA2013' Retry (3/10). Failure caused by ValueError("Regression for target gene KIAA2013 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target GIPC2' Retry (1/10). Failure caused by ValueError("Regression for target gene GIPC2 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target FBXO6' Retry (3/10). Failure caused by ValueError("Regression for target gene FBXO6 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target PIK3CD' Retry (1/10). Failure caused by ValueError("Regression for target gene PIK3CD failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target ODF2L' Retry (3/10). Failure caused by ValueError("Regression for target gene ODF2L failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AK5' Retry (1/10). Failure caused by ValueError("Regression for target gene AK5 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AC092343.1' Retry (2/10). Failure caused by ValueError("Regression for target gene AC092343.1 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AC092807.2' Retry (2/10). Failure caused by ValueError("Regression for target gene AC092807.2 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target CSMD2' Retry (1/10). Failure caused by ValueError("Regression for target gene CSMD2 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target BMP8A' Retry (2/10). Failure caused by ValueError("Regression for target gene BMP8A failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AGTRAP' Retry (2/10). Failure caused by ValueError("Regression for target gene AGTRAP failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AC103591.3' Retry (4/10). Failure caused by ValueError("Regression for target gene AC103591.3 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target KIAA2013' Retry (4/10). Failure caused by ValueError("Regression for target gene KIAA2013 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AL031429.1' Retry (1/10). Failure caused by ValueError("Regression for target gene AL031429.1 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target LRRC38' Retry (1/10). Failure caused by ValueError("Regression for target gene LRRC38 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target SLC44A5' Retry (1/10). Failure caused by ValueError("Regression for target gene SLC44A5 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AC092807.2' Retry (3/10). Failure caused by ValueError("Regression for target gene AC092807.2 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target ODF2L' Retry (4/10). Failure caused by ValueError("Regression for target gene ODF2L failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target CSMD2' Retry (2/10). Failure caused by ValueError("Regression for target gene CSMD2 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AC092343.1' Retry (3/10). Failure caused by ValueError("Regression for target gene AC092343.1 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target GIPC2' Retry (2/10). Failure caused by ValueError("Regression for target gene GIPC2 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target BMP8A' Retry (3/10). Failure caused by ValueError("Regression for target gene BMP8A failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AK5' Retry (2/10). Failure caused by ValueError("Regression for target gene AK5 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AGTRAP' Retry (3/10). Failure caused by ValueError("Regression for target gene AGTRAP failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target FBXO6' Retry (4/10). Failure caused by ValueError("Regression for target gene FBXO6 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AC103591.3' Retry (5/10). Failure caused by ValueError("Regression for target gene AC103591.3 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target PIK3CD' Retry (2/10). Failure caused by ValueError("Regression for target gene PIK3CD failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target KIAA2013' Retry (5/10). Failure caused by ValueError("Regression for target gene KIAA2013 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target C1orf109' Retry (1/10). Failure caused by ValueError("Regression for target gene C1orf109 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target LRRC38' Retry (2/10). Failure caused by ValueError("Regression for target gene LRRC38 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AL031429.1' Retry (2/10). Failure caused by ValueError("Regression for target gene AL031429.1 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AC093158.1' Retry (1/10). Failure caused by ValueError("Regression for target gene AC093158.1 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target TMCO4' Retry (1/10). Failure caused by ValueError("Regression for target gene TMCO4 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AC092807.2' Retry (4/10). Failure caused by ValueError("Regression for target gene AC092807.2 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AC092343.1' Retry (4/10). Failure caused by ValueError("Regression for target gene AC092343.1 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target CSMD2' Retry (3/10). Failure caused by ValueError("Regression for target gene CSMD2 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AK5' Retry (3/10). Failure caused by ValueError("Regression for target gene AK5 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target ODF2L' Retry (5/10). Failure caused by ValueError("Regression for target gene ODF2L failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target SLC44A5' Retry (2/10). Failure caused by ValueError("Regression for target gene SLC44A5 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target BMP8A' Retry (4/10). Failure caused by ValueError("Regression for target gene BMP8A failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AC103591.3' Retry (6/10). Failure caused by ValueError("Regression for target gene AC103591.3 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target GIPC2' Retry (3/10). Failure caused by ValueError("Regression for target gene GIPC2 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target C1orf109' Retry (2/10). Failure caused by ValueError("Regression for target gene C1orf109 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AL031429.1' Retry (3/10). Failure caused by ValueError("Regression for target gene AL031429.1 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target FBXO6' Retry (5/10). Failure caused by ValueError("Regression for target gene FBXO6 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target LRRC38' Retry (3/10). Failure caused by ValueError("Regression for target gene LRRC38 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target KIAA2013' Retry (6/10). Failure caused by ValueError("Regression for target gene KIAA2013 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AGTRAP' Retry (4/10). Failure caused by ValueError("Regression for target gene AGTRAP failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AC093158.1' Retry (2/10). Failure caused by ValueError("Regression for target gene AC093158.1 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target PIK3CD' Retry (3/10). Failure caused by ValueError("Regression for target gene PIK3CD failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target TMCO4' Retry (2/10). Failure caused by ValueError("Regression for target gene TMCO4 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AC092343.1' Retry (5/10). Failure caused by ValueError("Regression for target gene AC092343.1 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AC092807.2' Retry (5/10). Failure caused by ValueError("Regression for target gene AC092807.2 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target CSMD2' Retry (4/10). Failure caused by ValueError("Regression for target gene CSMD2 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AK5' Retry (4/10). Failure caused by ValueError("Regression for target gene AK5 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target SLC44A5' Retry (3/10). Failure caused by ValueError("Regression for target gene SLC44A5 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AC103591.3' Retry (7/10). Failure caused by ValueError("Regression for target gene AC103591.3 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target LRRC38' Retry (4/10). Failure caused by ValueError("Regression for target gene LRRC38 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target C1orf109' Retry (3/10). Failure caused by ValueError("Regression for target gene C1orf109 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target ODF2L' Retry (6/10). Failure caused by ValueError("Regression for target gene ODF2L failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target GIPC2' Retry (4/10). Failure caused by ValueError("Regression for target gene GIPC2 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AC093158.1' Retry (3/10). Failure caused by ValueError("Regression for target gene AC093158.1 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AGTRAP' Retry (5/10). Failure caused by ValueError("Regression for target gene AGTRAP failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target BMP8A' Retry (5/10). Failure caused by ValueError("Regression for target gene BMP8A failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AL031429.1' Retry (4/10). Failure caused by ValueError("Regression for target gene AL031429.1 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target FBXO6' Retry (6/10). Failure caused by ValueError("Regression for target gene FBXO6 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target PIK3CD' Retry (4/10). Failure caused by ValueError("Regression for target gene PIK3CD failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AC092343.1' Retry (6/10). Failure caused by ValueError("Regression for target gene AC092343.1 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target KIAA2013' Retry (7/10). Failure caused by ValueError("Regression for target gene KIAA2013 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target TMCO4' Retry (3/10). Failure caused by ValueError("Regression for target gene TMCO4 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target CSMD2' Retry (5/10). Failure caused by ValueError("Regression for target gene CSMD2 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AC092807.2' Retry (6/10). Failure caused by ValueError("Regression for target gene AC092807.2 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AK5' Retry (5/10). Failure caused by ValueError("Regression for target gene AK5 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target LRRC38' Retry (5/10). Failure caused by ValueError("Regression for target gene LRRC38 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target SLC44A5' Retry (4/10). Failure caused by ValueError("Regression for target gene SLC44A5 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target C1orf109' Retry (4/10). Failure caused by ValueError("Regression for target gene C1orf109 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target GIPC2' Retry (5/10). Failure caused by ValueError("Regression for target gene GIPC2 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AC093158.1' Retry (4/10). Failure caused by ValueError("Regression for target gene AC093158.1 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AGTRAP' Retry (6/10). Failure caused by ValueError("Regression for target gene AGTRAP failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target ODF2L' Retry (7/10). Failure caused by ValueError("Regression for target gene ODF2L failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target PIK3CD' Retry (5/10). Failure caused by ValueError("Regression for target gene PIK3CD failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AC092343.1' Retry (7/10). Failure caused by ValueError("Regression for target gene AC092343.1 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AC103591.3' Retry (8/10). Failure caused by ValueError("Regression for target gene AC103591.3 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target CSMD2' Retry (6/10). Failure caused by ValueError("Regression for target gene CSMD2 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AC092807.2' Retry (7/10). Failure caused by ValueError("Regression for target gene AC092807.2 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target TMCO4' Retry (4/10). Failure caused by ValueError("Regression for target gene TMCO4 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AK5' Retry (6/10). Failure caused by ValueError("Regression for target gene AK5 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target SLC44A5' Retry (5/10). Failure caused by ValueError("Regression for target gene SLC44A5 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target BMP8A' Retry (6/10). Failure caused by ValueError("Regression for target gene BMP8A failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AL031429.1' Retry (5/10). Failure caused by ValueError("Regression for target gene AL031429.1 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target C1orf109' Retry (5/10). Failure caused by ValueError("Regression for target gene C1orf109 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target FBXO6' Retry (7/10). Failure caused by ValueError("Regression for target gene FBXO6 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target GIPC2' Retry (6/10). Failure caused by ValueError("Regression for target gene GIPC2 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AC093158.1' Retry (5/10). Failure caused by ValueError("Regression for target gene AC093158.1 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target KIAA2013' Retry (8/10). Failure caused by ValueError("Regression for target gene KIAA2013 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AC092343.1' Retry (8/10). Failure caused by ValueError("Regression for target gene AC092343.1 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target LRRC38' Retry (6/10). Failure caused by ValueError("Regression for target gene LRRC38 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AGTRAP' Retry (7/10). Failure caused by ValueError("Regression for target gene AGTRAP failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target PIK3CD' Retry (6/10). Failure caused by ValueError("Regression for target gene PIK3CD failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target ODF2L' Retry (8/10). Failure caused by ValueError("Regression for target gene ODF2L failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target CSMD2' Retry (7/10). Failure caused by ValueError("Regression for target gene CSMD2 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AC092807.2' Retry (8/10). Failure caused by ValueError("Regression for target gene AC092807.2 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AK5' Retry (7/10). Failure caused by ValueError("Regression for target gene AK5 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target TMCO4' Retry (5/10). Failure caused by ValueError("Regression for target gene TMCO4 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AC103591.3' Retry (9/10). Failure caused by ValueError("Regression for target gene AC103591.3 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target SLC44A5' Retry (6/10). Failure caused by ValueError("Regression for target gene SLC44A5 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target GIPC2' Retry (7/10). Failure caused by ValueError("Regression for target gene GIPC2 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AC093158.1' Retry (6/10). Failure caused by ValueError("Regression for target gene AC093158.1 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AC092343.1' Retry (9/10). Failure caused by ValueError("Regression for target gene AC092343.1 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target C1orf109' Retry (6/10). Failure caused by ValueError("Regression for target gene C1orf109 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AGTRAP' Retry (8/10). Failure caused by ValueError("Regression for target gene AGTRAP failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target PIK3CD' Retry (7/10). Failure caused by ValueError("Regression for target gene PIK3CD failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target LRRC38' Retry (7/10). Failure caused by ValueError("Regression for target gene LRRC38 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target ODF2L' Retry (9/10). Failure caused by ValueError("Regression for target gene ODF2L failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AC092807.2' Retry (9/10). Failure caused by ValueError("Regression for target gene AC092807.2 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target BMP8A' Retry (7/10). Failure caused by ValueError("Regression for target gene BMP8A failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AL031429.1' Retry (6/10). Failure caused by ValueError("Regression for target gene AL031429.1 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target FBXO6' Retry (8/10). Failure caused by ValueError("Regression for target gene FBXO6 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target TMCO4' Retry (6/10). Failure caused by ValueError("Regression for target gene TMCO4 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target CSMD2' Retry (8/10). Failure caused by ValueError("Regression for target gene CSMD2 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AC103591.3' Retry (10/10). Failure caused by ValueError("Regression for target gene AC103591.3 failed. Cause ValueError('buffer source array is read-only').").
distributed.worker - WARNING - Compute Failed
Function: execute_task
args: ((<function check_meta at 0x7f22488fa158>, (<function infer_partial_network at 0x7f22387e2268>, 'GBM', (<class 'dict'>, [['learning_rate', 0.01], ['n_estimators', 5000], ['max_features', 0.1], ['subsample', 0.9]]), array([[0., 0., 0., ..., 0., 0., 0.],
[0., 0., 0., ..., 0., 0., 0.],
[0., 1., 1., ..., 0., 0., 0.],
...,
[1., 0., 1., ..., 1., 0., 0.],
[2., 0., 0., ..., 0., 0., 0.],
[0., 0., 1., ..., 0., 0., 0.]], dtype=float32), ['NOC2L', 'HES4', 'CPTP', 'MORN1', 'HES5', 'PRDM16', 'TP73', 'HES2', 'ZBTB48', 'VAMP3', 'ENO1', 'AGMAT', 'ZBTB17', 'HP1BP3', 'ZBTB40', 'LUZP1', 'ZNF436', 'ASAP3', 'E2F2', 'RUNX3', 'LIN28A', 'GMEB1', 'ZCCHC17', 'HDAC1', 'ZBTB8B', 'ZNF362', 'ZSCAN20', 'SFPQ', 'TFAP2E', 'MTF1', 'POU3F1', 'HEYL', 'MYCL', 'RLF', 'SMAP2', 'ZFP69B', 'ZFP69', 'EXO5', 'ZNF684', 'NFYC', 'SCMH1', 'FOXO6', 'HIVEP3', 'FOXJ3', 'YBX1', 'ZNF691', 'KDM4A', 'KLF17', 'DMAP1', 'AKR1A1', 'DMBX1', 'FOXE3', 'FOXD2', 'DMRTA2', 'MAGOH', 'GLIS1', 'SSBP3', 'PRKAA2',
kwargs: {}
Exception: ValueError('Metadata mismatch found in from_delayed.\n\nExpected partition of type DataFrame but got NoneType')

'infer_data failed for target KIAA2013' Retry (9/10). Failure caused by ValueError("Regression for target gene KIAA2013 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target GIPC2' Retry (8/10). Failure caused by ValueError("Regression for target gene GIPC2 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target SLC44A5' Retry (7/10). Failure caused by ValueError("Regression for target gene SLC44A5 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AC093158.1' Retry (7/10). Failure caused by ValueError("Regression for target gene AC093158.1 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AC092343.1' Retry (10/10). Failure caused by ValueError("Regression for target gene AC092343.1 failed. Cause ValueError('buffer source array is read-only').").
distributed.worker - WARNING - Compute Failed
Function: execute_task
args: ((<function check_meta at 0x7f22488fa158>, (<function infer_partial_network at 0x7f22387e2268>, 'GBM', (<class 'dict'>, [['learning_rate', 0.01], ['n_estimators', 5000], ['max_features', 0.1], ['subsample', 0.9]]), array([[0., 0., 0., ..., 0., 0., 0.],
[0., 0., 0., ..., 0., 0., 0.],
[0., 1., 1., ..., 0., 0., 0.],
...,
[1., 0., 1., ..., 1., 0., 0.],
[2., 0., 0., ..., 0., 0., 0.],
[0., 0., 1., ..., 0., 0., 0.]], dtype=float32), ['NOC2L', 'HES4', 'CPTP', 'MORN1', 'HES5', 'PRDM16', 'TP73', 'HES2', 'ZBTB48', 'VAMP3', 'ENO1', 'AGMAT', 'ZBTB17', 'HP1BP3', 'ZBTB40', 'LUZP1', 'ZNF436', 'ASAP3', 'E2F2', 'RUNX3', 'LIN28A', 'GMEB1', 'ZCCHC17', 'HDAC1', 'ZBTB8B', 'ZNF362', 'ZSCAN20', 'SFPQ', 'TFAP2E', 'MTF1', 'POU3F1', 'HEYL', 'MYCL', 'RLF', 'SMAP2', 'ZFP69B', 'ZFP69', 'EXO5', 'ZNF684', 'NFYC', 'SCMH1', 'FOXO6', 'HIVEP3', 'FOXJ3', 'YBX1', 'ZNF691', 'KDM4A', 'KLF17', 'DMAP1', 'AKR1A1', 'DMBX1', 'FOXE3', 'FOXD2', 'DMRTA2', 'MAGOH', 'GLIS1', 'SSBP3', 'PRKAA2',
kwargs: {}
Exception: ValueError('Metadata mismatch found in from_delayed.\n\nExpected partition of type DataFrame but got NoneType')

'infer_data failed for target AGTRAP' Retry (9/10). Failure caused by ValueError("Regression for target gene AGTRAP failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target LRRC38' Retry (8/10). Failure caused by ValueError("Regression for target gene LRRC38 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target ODF2L' Retry (10/10). Failure caused by ValueError("Regression for target gene ODF2L failed. Cause ValueError('buffer source array is read-only').").
distributed.worker - WARNING - Compute Failed
Function: execute_task
args: ((<function check_meta at 0x7f22488fa158>, (<function infer_partial_network at 0x7f22387e2268>, 'GBM', (<class 'dict'>, [['learning_rate', 0.01], ['n_estimators', 5000], ['max_features', 0.1], ['subsample', 0.9]]), array([[0., 0., 0., ..., 0., 0., 0.],
[0., 0., 0., ..., 0., 0., 0.],
[0., 1., 1., ..., 0., 0., 0.],
...,
[1., 0., 1., ..., 1., 0., 0.],
[2., 0., 0., ..., 0., 0., 0.],
[0., 0., 1., ..., 0., 0., 0.]], dtype=float32), ['NOC2L', 'HES4', 'CPTP', 'MORN1', 'HES5', 'PRDM16', 'TP73', 'HES2', 'ZBTB48', 'VAMP3', 'ENO1', 'AGMAT', 'ZBTB17', 'HP1BP3', 'ZBTB40', 'LUZP1', 'ZNF436', 'ASAP3', 'E2F2', 'RUNX3', 'LIN28A', 'GMEB1', 'ZCCHC17', 'HDAC1', 'ZBTB8B', 'ZNF362', 'ZSCAN20', 'SFPQ', 'TFAP2E', 'MTF1', 'POU3F1', 'HEYL', 'MYCL', 'RLF', 'SMAP2', 'ZFP69B', 'ZFP69', 'EXO5', 'ZNF684', 'NFYC', 'SCMH1', 'FOXO6', 'HIVEP3', 'FOXJ3', 'YBX1', 'ZNF691', 'KDM4A', 'KLF17', 'DMAP1', 'AKR1A1', 'DMBX1', 'FOXE3', 'FOXD2', 'DMRTA2', 'MAGOH', 'GLIS1', 'SSBP3', 'PRKAA2',
kwargs: {}
Exception: ValueError('Metadata mismatch found in from_delayed.\n\nExpected partition of type DataFrame but got NoneType')

'infer_data failed for target AC092807.2' Retry (10/10). Failure caused by ValueError("Regression for target gene AC092807.2 failed. Cause ValueError('buffer source array is read-only').").
distributed.worker - WARNING - Compute Failed
Function: execute_task
args: ((<function check_meta at 0x7f22488fa158>, (<function infer_partial_network at 0x7f22387e2268>, 'GBM', (<class 'dict'>, [['learning_rate', 0.01], ['n_estimators', 5000], ['max_features', 0.1], ['subsample', 0.9]]), array([[0., 0., 0., ..., 0., 0., 0.],
[0., 0., 0., ..., 0., 0., 0.],
[0., 1., 1., ..., 0., 0., 0.],
...,
[1., 0., 1., ..., 1., 0., 0.],
[2., 0., 0., ..., 0., 0., 0.],
[0., 0., 1., ..., 0., 0., 0.]], dtype=float32), ['NOC2L', 'HES4', 'CPTP', 'MORN1', 'HES5', 'PRDM16', 'TP73', 'HES2', 'ZBTB48', 'VAMP3', 'ENO1', 'AGMAT', 'ZBTB17', 'HP1BP3', 'ZBTB40', 'LUZP1', 'ZNF436', 'ASAP3', 'E2F2', 'RUNX3', 'LIN28A', 'GMEB1', 'ZCCHC17', 'HDAC1', 'ZBTB8B', 'ZNF362', 'ZSCAN20', 'SFPQ', 'TFAP2E', 'MTF1', 'POU3F1', 'HEYL', 'MYCL', 'RLF', 'SMAP2', 'ZFP69B', 'ZFP69', 'EXO5', 'ZNF684', 'NFYC', 'SCMH1', 'FOXO6', 'HIVEP3', 'FOXJ3', 'YBX1', 'ZNF691', 'KDM4A', 'KLF17', 'DMAP1', 'AKR1A1', 'DMBX1', 'FOXE3', 'FOXD2', 'DMRTA2', 'MAGOH', 'GLIS1', 'SSBP3', 'PRKAA2',
kwargs: {}
Exception: ValueError('Metadata mismatch found in from_delayed.\n\nExpected partition of type DataFrame but got NoneType')

'infer_data failed for target PIK3CD' Retry (8/10). Failure caused by ValueError("Regression for target gene PIK3CD failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AK5' Retry (8/10). Failure caused by ValueError("Regression for target gene AK5 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target BMP8A' Retry (8/10). Failure caused by ValueError("Regression for target gene BMP8A failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target TMCO4' Retry (7/10). Failure caused by ValueError("Regression for target gene TMCO4 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target CSMD2' Retry (9/10). Failure caused by ValueError("Regression for target gene CSMD2 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target GIPC2' Retry (9/10). Failure caused by ValueError("Regression for target gene GIPC2 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target C1orf109' Retry (7/10). Failure caused by ValueError("Regression for target gene C1orf109 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target CNTN2' Retry (1/10). Failure caused by ValueError("Regression for target gene CNTN2 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AC093158.1' Retry (8/10). Failure caused by ValueError("Regression for target gene AC093158.1 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AL031429.1' Retry (7/10). Failure caused by ValueError("Regression for target gene AL031429.1 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AL391832.2' Retry (1/10). Failure caused by ValueError("Regression for target gene AL391832.2 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AGTRAP' Retry (10/10). Failure caused by ValueError("Regression for target gene AGTRAP failed. Cause ValueError('buffer source array is read-only').").
distributed.worker - WARNING - Compute Failed
Function: execute_task
args: ((<function check_meta at 0x7f22488fa158>, (<function infer_partial_network at 0x7f22387e2268>, 'GBM', (<class 'dict'>, [['learning_rate', 0.01], ['n_estimators', 5000], ['max_features', 0.1], ['subsample', 0.9]]), array([[0., 0., 0., ..., 0., 0., 0.],
[0., 0., 0., ..., 0., 0., 0.],
[0., 1., 1., ..., 0., 0., 0.],
...,
[1., 0., 1., ..., 1., 0., 0.],
[2., 0., 0., ..., 0., 0., 0.],
[0., 0., 1., ..., 0., 0., 0.]], dtype=float32), ['NOC2L', 'HES4', 'CPTP', 'MORN1', 'HES5', 'PRDM16', 'TP73', 'HES2', 'ZBTB48', 'VAMP3', 'ENO1', 'AGMAT', 'ZBTB17', 'HP1BP3', 'ZBTB40', 'LUZP1', 'ZNF436', 'ASAP3', 'E2F2', 'RUNX3', 'LIN28A', 'GMEB1', 'ZCCHC17', 'HDAC1', 'ZBTB8B', 'ZNF362', 'ZSCAN20', 'SFPQ', 'TFAP2E', 'MTF1', 'POU3F1', 'HEYL', 'MYCL', 'RLF', 'SMAP2', 'ZFP69B', 'ZFP69', 'EXO5', 'ZNF684', 'NFYC', 'SCMH1', 'FOXO6', 'HIVEP3', 'FOXJ3', 'YBX1', 'ZNF691', 'KDM4A', 'KLF17', 'DMAP1', 'AKR1A1', 'DMBX1', 'FOXE3', 'FOXD2', 'DMRTA2', 'MAGOH', 'GLIS1', 'SSBP3', 'PRKAA2',
kwargs: {}
Exception: ValueError('Metadata mismatch found in from_delayed.\n\nExpected partition of type DataFrame but got NoneType')

'infer_data failed for target FBXO6' Retry (9/10). Failure caused by ValueError("Regression for target gene FBXO6 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target B4GALT3' Retry (1/10). Failure caused by ValueError("Regression for target gene B4GALT3 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target SLC44A5' Retry (8/10). Failure caused by ValueError("Regression for target gene SLC44A5 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target KIAA2013' Retry (10/10). Failure caused by ValueError("Regression for target gene KIAA2013 failed. Cause ValueError('buffer source array is read-only').").
distributed.worker - WARNING - Compute Failed
Function: execute_task
args: ((<function check_meta at 0x7f22488fa1e0>, (<function infer_partial_network at 0x7f22387e22f0>, 'GBM', (<class 'dict'>, [['learning_rate', 0.01], ['n_estimators', 5000], ['max_features', 0.1], ['subsample', 0.9]]), array([[0., 0., 0., ..., 0., 0., 0.],
[0., 0., 0., ..., 0., 0., 0.],
[0., 1., 1., ..., 0., 0., 0.],
...,
[1., 0., 1., ..., 1., 0., 0.],
[2., 0., 0., ..., 0., 0., 0.],
[0., 0., 1., ..., 0., 0., 0.]], dtype=float32), ['NOC2L', 'HES4', 'CPTP', 'MORN1', 'HES5', 'PRDM16', 'TP73', 'HES2', 'ZBTB48', 'VAMP3', 'ENO1', 'AGMAT', 'ZBTB17', 'HP1BP3', 'ZBTB40', 'LUZP1', 'ZNF436', 'ASAP3', 'E2F2', 'RUNX3', 'LIN28A', 'GMEB1', 'ZCCHC17', 'HDAC1', 'ZBTB8B', 'ZNF362', 'ZSCAN20', 'SFPQ', 'TFAP2E', 'MTF1', 'POU3F1', 'HEYL', 'MYCL', 'RLF', 'SMAP2', 'ZFP69B', 'ZFP69', 'EXO5', 'ZNF684', 'NFYC', 'SCMH1', 'FOXO6', 'HIVEP3', 'FOXJ3', 'YBX1', 'ZNF691', 'KDM4A', 'KLF17', 'DMAP1', 'AKR1A1', 'DMBX1', 'FOXE3', 'FOXD2', 'DMRTA2', 'MAGOH', 'GLIS1', 'SSBP3', 'PRKAA2',
kwargs: {}
Exception: ValueError('Metadata mismatch found in from_delayed.\n\nExpected partition of type DataFrame but got NoneType')

'infer_data failed for target PI4KB' Retry (1/10). Failure caused by ValueError("Regression for target gene PI4KB failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target LRRC38' Retry (9/10). Failure caused by ValueError("Regression for target gene LRRC38 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target CSMD2' Retry (10/10). Failure caused by ValueError("Regression for target gene CSMD2 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target PIK3CD' Retry (9/10). Failure caused by ValueError("Regression for target gene PIK3CD failed. Cause ValueError('buffer source array is read-only').").
distributed.worker - WARNING - Compute Failed
Function: execute_task
args: ((<function check_meta at 0x7f22488fa158>, (<function infer_partial_network at 0x7f22387e2268>, 'GBM', (<class 'dict'>, [['learning_rate', 0.01], ['n_estimators', 5000], ['max_features', 0.1], ['subsample', 0.9]]), array([[0., 0., 0., ..., 0., 0., 0.],
[0., 0., 0., ..., 0., 0., 0.],
[0., 1., 1., ..., 0., 0., 0.],
...,
[1., 0., 1., ..., 1., 0., 0.],
[2., 0., 0., ..., 0., 0., 0.],
[0., 0., 1., ..., 0., 0., 0.]], dtype=float32), ['NOC2L', 'HES4', 'CPTP', 'MORN1', 'HES5', 'PRDM16', 'TP73', 'HES2', 'ZBTB48', 'VAMP3', 'ENO1', 'AGMAT', 'ZBTB17', 'HP1BP3', 'ZBTB40', 'LUZP1', 'ZNF436', 'ASAP3', 'E2F2', 'RUNX3', 'LIN28A', 'GMEB1', 'ZCCHC17', 'HDAC1', 'ZBTB8B', 'ZNF362', 'ZSCAN20', 'SFPQ', 'TFAP2E', 'MTF1', 'POU3F1', 'HEYL', 'MYCL', 'RLF', 'SMAP2', 'ZFP69B', 'ZFP69', 'EXO5', 'ZNF684', 'NFYC', 'SCMH1', 'FOXO6', 'HIVEP3', 'FOXJ3', 'YBX1', 'ZNF691', 'KDM4A', 'KLF17', 'DMAP1', 'AKR1A1', 'DMBX1', 'FOXE3', 'FOXD2', 'DMRTA2', 'MAGOH', 'GLIS1', 'SSBP3', 'PRKAA2',
kwargs: {}
Exception: ValueError('Metadata mismatch found in from_delayed.\n\nExpected partition of type DataFrame but got NoneType')

'infer_data failed for target GIPC2' Retry (10/10). Failure caused by ValueError("Regression for target gene GIPC2 failed. Cause ValueError('buffer source array is read-only').").
distributed.worker - WARNING - Compute Failed
Function: execute_task
args: ((<function check_meta at 0x7f22488fa158>, (<function infer_partial_network at 0x7f22387e2268>, 'GBM', (<class 'dict'>, [['learning_rate', 0.01], ['n_estimators', 5000], ['max_features', 0.1], ['subsample', 0.9]]), array([[0., 0., 0., ..., 0., 0., 0.],
[0., 0., 0., ..., 0., 0., 0.],
[0., 1., 1., ..., 0., 0., 0.],
...,
[1., 0., 1., ..., 1., 0., 0.],
[2., 0., 0., ..., 0., 0., 0.],
[0., 0., 1., ..., 0., 0., 0.]], dtype=float32), ['NOC2L', 'HES4', 'CPTP', 'MORN1', 'HES5', 'PRDM16', 'TP73', 'HES2', 'ZBTB48', 'VAMP3', 'ENO1', 'AGMAT', 'ZBTB17', 'HP1BP3', 'ZBTB40', 'LUZP1', 'ZNF436', 'ASAP3', 'E2F2', 'RUNX3', 'LIN28A', 'GMEB1', 'ZCCHC17', 'HDAC1', 'ZBTB8B', 'ZNF362', 'ZSCAN20', 'SFPQ', 'TFAP2E', 'MTF1', 'POU3F1', 'HEYL', 'MYCL', 'RLF', 'SMAP2', 'ZFP69B', 'ZFP69', 'EXO5', 'ZNF684', 'NFYC', 'SCMH1', 'FOXO6', 'HIVEP3', 'FOXJ3', 'YBX1', 'ZNF691', 'KDM4A', 'KLF17', 'DMAP1', 'AKR1A1', 'DMBX1', 'FOXE3', 'FOXD2', 'DMRTA2', 'MAGOH', 'GLIS1', 'SSBP3', 'PRKAA2',
kwargs: {}
Exception: ValueError('Metadata mismatch found in from_delayed.\n\nExpected partition of type DataFrame but got NoneType')

'infer_data failed for target TMCO4' Retry (8/10). Failure caused by ValueError("Regression for target gene TMCO4 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target CNTN2' Retry (2/10). Failure caused by ValueError("Regression for target gene CNTN2 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AC093158.1' Retry (9/10). Failure caused by ValueError("Regression for target gene AC093158.1 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AL031429.1' Retry (8/10). Failure caused by ValueError("Regression for target gene AL031429.1 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target C1orf109' Retry (8/10). Failure caused by ValueError("Regression for target gene C1orf109 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target MRPS14' Retry (1/10). Failure caused by ValueError("Regression for target gene MRPS14 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AL391832.2' Retry (2/10). Failure caused by ValueError("Regression for target gene AL391832.2 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target BMP8A' Retry (9/10). Failure caused by ValueError("Regression for target gene BMP8A failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AK5' Retry (9/10). Failure caused by ValueError("Regression for target gene AK5 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target PI4KB' Retry (2/10). Failure caused by ValueError("Regression for target gene PI4KB failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target LRRC38' Retry (10/10). Failure caused by ValueError("Regression for target gene LRRC38 failed. Cause ValueError('buffer source array is read-only').").
distributed.worker - WARNING - Compute Failed
Function: execute_task
args: ((<function check_meta at 0x7f22488fa158>, (<function infer_partial_network at 0x7f22387e2268>, 'GBM', (<class 'dict'>, [['learning_rate', 0.01], ['n_estimators', 5000], ['max_features', 0.1], ['subsample', 0.9]]), array([[0., 0., 0., ..., 0., 0., 0.],
[0., 0., 0., ..., 0., 0., 0.],
[0., 1., 1., ..., 0., 0., 0.],
...,
[1., 0., 1., ..., 1., 0., 0.],
[2., 0., 0., ..., 0., 0., 0.],
[0., 0., 1., ..., 0., 0., 0.]], dtype=float32), ['NOC2L', 'HES4', 'CPTP', 'MORN1', 'HES5', 'PRDM16', 'TP73', 'HES2', 'ZBTB48', 'VAMP3', 'ENO1', 'AGMAT', 'ZBTB17', 'HP1BP3', 'ZBTB40', 'LUZP1', 'ZNF436', 'ASAP3', 'E2F2', 'RUNX3', 'LIN28A', 'GMEB1', 'ZCCHC17', 'HDAC1', 'ZBTB8B', 'ZNF362', 'ZSCAN20', 'SFPQ', 'TFAP2E', 'MTF1', 'POU3F1', 'HEYL', 'MYCL', 'RLF', 'SMAP2', 'ZFP69B', 'ZFP69', 'EXO5', 'ZNF684', 'NFYC', 'SCMH1', 'FOXO6', 'HIVEP3', 'FOXJ3', 'YBX1', 'ZNF691', 'KDM4A', 'KLF17', 'DMAP1', 'AKR1A1', 'DMBX1', 'FOXE3', 'FOXD2', 'DMRTA2', 'MAGOH', 'GLIS1', 'SSBP3', 'PRKAA2',
kwargs: {}
Exception: ValueError('Metadata mismatch found in from_delayed.\n\nExpected partition of type DataFrame but got NoneType')

'infer_data failed for target PIK3CD' Retry (10/10). Failure caused by ValueError("Regression for target gene PIK3CD failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target B4GALT3' Retry (2/10). Failure caused by ValueError("Regression for target gene B4GALT3 failed. Cause ValueError('buffer source array is read-only').").
distributed.worker - WARNING - Compute Failed
Function: execute_task
args: ((<function check_meta at 0x7f22488fa158>, (<function infer_partial_network at 0x7f22387e2268>, 'GBM', (<class 'dict'>, [['learning_rate', 0.01], ['n_estimators', 5000], ['max_features', 0.1], ['subsample', 0.9]]), array([[0., 0., 0., ..., 0., 0., 0.],
[0., 0., 0., ..., 0., 0., 0.],
[0., 1., 1., ..., 0., 0., 0.],
...,
[1., 0., 1., ..., 1., 0., 0.],
[2., 0., 0., ..., 0., 0., 0.],
[0., 0., 1., ..., 0., 0., 0.]], dtype=float32), ['NOC2L', 'HES4', 'CPTP', 'MORN1', 'HES5', 'PRDM16', 'TP73', 'HES2', 'ZBTB48', 'VAMP3', 'ENO1', 'AGMAT', 'ZBTB17', 'HP1BP3', 'ZBTB40', 'LUZP1', 'ZNF436', 'ASAP3', 'E2F2', 'RUNX3', 'LIN28A', 'GMEB1', 'ZCCHC17', 'HDAC1', 'ZBTB8B', 'ZNF362', 'ZSCAN20', 'SFPQ', 'TFAP2E', 'MTF1', 'POU3F1', 'HEYL', 'MYCL', 'RLF', 'SMAP2', 'ZFP69B', 'ZFP69', 'EXO5', 'ZNF684', 'NFYC', 'SCMH1', 'FOXO6', 'HIVEP3', 'FOXJ3', 'YBX1', 'ZNF691', 'KDM4A', 'KLF17', 'DMAP1', 'AKR1A1', 'DMBX1', 'FOXE3', 'FOXD2', 'DMRTA2', 'MAGOH', 'GLIS1', 'SSBP3', 'PRKAA2',
kwargs: {}
Exception: ValueError('Metadata mismatch found in from_delayed.\n\nExpected partition of type DataFrame but got NoneType')

'infer_data failed for target TRMT1L' Retry (1/10). Failure caused by ValueError("Regression for target gene TRMT1L failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target TMCO4' Retry (9/10). Failure caused by ValueError("Regression for target gene TMCO4 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target DUSP12' Retry (1/10). Failure caused by ValueError("Regression for target gene DUSP12 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target CNTN2' Retry (3/10). Failure caused by ValueError("Regression for target gene CNTN2 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target SLC44A5' Retry (9/10). Failure caused by ValueError("Regression for target gene SLC44A5 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AL031429.1' Retry (9/10). Failure caused by ValueError("Regression for target gene AL031429.1 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target FBXO6' Retry (10/10). Failure caused by ValueError("Regression for target gene FBXO6 failed. Cause ValueError('buffer source array is read-only').").
distributed.worker - WARNING - Compute Failed
Function: execute_task
args: ((<function check_meta at 0x7f22488fa158>, (<function infer_partial_network at 0x7f22387e2268>, 'GBM', (<class 'dict'>, [['learning_rate', 0.01], ['n_estimators', 5000], ['max_features', 0.1], ['subsample', 0.9]]), array([[0., 0., 0., ..., 0., 0., 0.],
[0., 0., 0., ..., 0., 0., 0.],
[0., 1., 1., ..., 0., 0., 0.],
...,
[1., 0., 1., ..., 1., 0., 0.],
[2., 0., 0., ..., 0., 0., 0.],
[0., 0., 1., ..., 0., 0., 0.]], dtype=float32), ['NOC2L', 'HES4', 'CPTP', 'MORN1', 'HES5', 'PRDM16', 'TP73', 'HES2', 'ZBTB48', 'VAMP3', 'ENO1', 'AGMAT', 'ZBTB17', 'HP1BP3', 'ZBTB40', 'LUZP1', 'ZNF436', 'ASAP3', 'E2F2', 'RUNX3', 'LIN28A', 'GMEB1', 'ZCCHC17', 'HDAC1', 'ZBTB8B', 'ZNF362', 'ZSCAN20', 'SFPQ', 'TFAP2E', 'MTF1', 'POU3F1', 'HEYL', 'MYCL', 'RLF', 'SMAP2', 'ZFP69B', 'ZFP69', 'EXO5', 'ZNF684', 'NFYC', 'SCMH1', 'FOXO6', 'HIVEP3', 'FOXJ3', 'YBX1', 'ZNF691', 'KDM4A', 'KLF17', 'DMAP1', 'AKR1A1', 'DMBX1', 'FOXE3', 'FOXD2', 'DMRTA2', 'MAGOH', 'GLIS1', 'SSBP3', 'PRKAA2',
kwargs: {}
Exception: ValueError('Metadata mismatch found in from_delayed.\n\nExpected partition of type DataFrame but got NoneType')

'infer_data failed for target AC093158.1' Retry (10/10). Failure caused by ValueError("Regression for target gene AC093158.1 failed. Cause ValueError('buffer source array is read-only').").
distributed.worker - WARNING - Compute Failed
Function: execute_task
args: ((<function check_meta at 0x7f22488fb158>, (<function infer_partial_network at 0x7f22387e2268>, 'GBM', (<class 'dict'>, [['learning_rate', 0.01], ['n_estimators', 5000], ['max_features', 0.1], ['subsample', 0.9]]), array([[0., 0., 0., ..., 0., 0., 0.],
[0., 0., 0., ..., 0., 0., 0.],
[0., 1., 1., ..., 0., 0., 0.],
...,
[1., 0., 1., ..., 1., 0., 0.],
[2., 0., 0., ..., 0., 0., 0.],
[0., 0., 1., ..., 0., 0., 0.]], dtype=float32), ['NOC2L', 'HES4', 'CPTP', 'MORN1', 'HES5', 'PRDM16', 'TP73', 'HES2', 'ZBTB48', 'VAMP3', 'ENO1', 'AGMAT', 'ZBTB17', 'HP1BP3', 'ZBTB40', 'LUZP1', 'ZNF436', 'ASAP3', 'E2F2', 'RUNX3', 'LIN28A', 'GMEB1', 'ZCCHC17', 'HDAC1', 'ZBTB8B', 'ZNF362', 'ZSCAN20', 'SFPQ', 'TFAP2E', 'MTF1', 'POU3F1', 'HEYL', 'MYCL', 'RLF', 'SMAP2', 'ZFP69B', 'ZFP69', 'EXO5', 'ZNF684', 'NFYC', 'SCMH1', 'FOXO6', 'HIVEP3', 'FOXJ3', 'YBX1', 'ZNF691', 'KDM4A', 'KLF17', 'DMAP1', 'AKR1A1', 'DMBX1', 'FOXE3', 'FOXD2', 'DMRTA2', 'MAGOH', 'GLIS1', 'SSBP3', 'PRKAA2',
kwargs: {}
Exception: ValueError('Metadata mismatch found in from_delayed.\n\nExpected partition of type DataFrame but got NoneType')

not shutting down client, client was created externally
finished
'infer_data failed for target DENND2D' Retry (1/10). Failure caused by ValueError("Regression for target gene DENND2D failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target MRPS14' Retry (2/10). Failure caused by ValueError("Regression for target gene MRPS14 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target C1orf109' Retry (9/10). Failure caused by ValueError("Regression for target gene C1orf109 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AL391832.2' Retry (3/10). Failure caused by ValueError("Regression for target gene AL391832.2 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target PI4KB' Retry (3/10). Failure caused by ValueError("Regression for target gene PI4KB failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target TRMT1L' Retry (2/10). Failure caused by ValueError("Regression for target gene TRMT1L failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target ARHGEF11' Retry (1/10). Failure caused by ValueError("Regression for target gene ARHGEF11 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target BMP8A' Retry (10/10). Failure caused by ValueError("Regression for target gene BMP8A failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target DUSP12' Retry (2/10). Failure caused by ValueError("Regression for target gene DUSP12 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target CNTN2' Retry (4/10). Failure caused by ValueError("Regression for target gene CNTN2 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target B4GALT3' Retry (3/10). Failure caused by ValueError("Regression for target gene B4GALT3 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target TMCO4' Retry (10/10). Failure caused by ValueError("Regression for target gene TMCO4 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AL031429.1' Retry (10/10). Failure caused by ValueError("Regression for target gene AL031429.1 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AK5' Retry (10/10). Failure caused by ValueError("Regression for target gene AK5 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target UAP1' Retry (1/10). Failure caused by ValueError("Regression for target gene UAP1 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target SLC44A5' Retry (10/10). Failure caused by ValueError("Regression for target gene SLC44A5 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target ANKRD45' Retry (1/10). Failure caused by ValueError("Regression for target gene ANKRD45 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AL391832.2' Retry (4/10). Failure caused by ValueError("Regression for target gene AL391832.2 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target DENND2D' Retry (2/10). Failure caused by ValueError("Regression for target gene DENND2D failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target MRPS14' Retry (3/10). Failure caused by ValueError("Regression for target gene MRPS14 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target PI4KB' Retry (4/10). Failure caused by ValueError("Regression for target gene PI4KB failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target CNTN2' Retry (5/10). Failure caused by ValueError("Regression for target gene CNTN2 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target DUSP12' Retry (3/10). Failure caused by ValueError("Regression for target gene DUSP12 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target TRMT1L' Retry (3/10). Failure caused by ValueError("Regression for target gene TRMT1L failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target C1orf109' Retry (10/10). Failure caused by ValueError("Regression for target gene C1orf109 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target ARHGEF11' Retry (2/10). Failure caused by ValueError("Regression for target gene ARHGEF11 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target B4GALT3' Retry (4/10). Failure caused by ValueError("Regression for target gene B4GALT3 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target UAP1' Retry (2/10). Failure caused by ValueError("Regression for target gene UAP1 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target ANKRD45' Retry (2/10). Failure caused by ValueError("Regression for target gene ANKRD45 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AL391832.2' Retry (5/10). Failure caused by ValueError("Regression for target gene AL391832.2 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target DENND2D' Retry (3/10). Failure caused by ValueError("Regression for target gene DENND2D failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target MRPS14' Retry (4/10). Failure caused by ValueError("Regression for target gene MRPS14 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target PI4KB' Retry (5/10). Failure caused by ValueError("Regression for target gene PI4KB failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target CNTN2' Retry (6/10). Failure caused by ValueError("Regression for target gene CNTN2 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target TRMT1L' Retry (4/10). Failure caused by ValueError("Regression for target gene TRMT1L failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target ARHGEF11' Retry (3/10). Failure caused by ValueError("Regression for target gene ARHGEF11 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target B4GALT3' Retry (5/10). Failure caused by ValueError("Regression for target gene B4GALT3 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target DUSP12' Retry (4/10). Failure caused by ValueError("Regression for target gene DUSP12 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target ANKRD45' Retry (3/10). Failure caused by ValueError("Regression for target gene ANKRD45 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target AL391832.2' Retry (6/10). Failure caused by ValueError("Regression for target gene AL391832.2 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target UAP1' Retry (3/10). Failure caused by ValueError("Regression for target gene UAP1 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target DENND2D' Retry (4/10). Failure caused by ValueError("Regression for target gene DENND2D failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target CNTN2' Retry (7/10). Failure caused by ValueError("Regression for target gene CNTN2 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target MRPS14' Retry (5/10). Failure caused by ValueError("Regression for target gene MRPS14 failed. Cause ValueError('buffer source array is read-only').").
'infer_data failed for target PI4KB' Retry (6/10). Failure caused by ValueError("Regression for target gene PI4KB failed. Cause ValueError('buffer source array is read-only').").
Traceback (most recent call last):
File "/home/kbaral/anaconda3/bin/pyscenic", line 10, in
sys.exit(main())
File "/home/kbaral/anaconda3/lib/python3.7/site-packages/pyscenic/cli/pyscenic.py", line 420, in main
args.func(args)
File "/home/kbaral/anaconda3/lib/python3.7/site-packages/pyscenic/cli/pyscenic.py", line 72, in find_adjacencies_command
network = method(expression_data=ex_mtx, tf_names=tf_names, verbose=True, client_or_address=client, seed=args.seed)
File "/home/kbaral/anaconda3/lib/python3.7/site-packages/arboreto/algo.py", line 41, in grnboost2
early_stop_window_length=early_stop_window_length, limit=limit, seed=seed, verbose=verbose)
File "/home/kbaral/anaconda3/lib/python3.7/site-packages/arboreto/algo.py", line 135, in diy
.compute(graph, sync=True)
File "/home/kbaral/anaconda3/lib/python3.7/site-packages/distributed/client.py", line 2758, in compute
result = self.gather(futures)
File "/home/kbaral/anaconda3/lib/python3.7/site-packages/distributed/client.py", line 1822, in gather
asynchronous=asynchronous,
File "/home/kbaral/anaconda3/lib/python3.7/site-packages/distributed/client.py", line 753, in sync
return sync(self.loop, func, *args, **kwargs)
File "/home/kbaral/anaconda3/lib/python3.7/site-packages/distributed/utils.py", line 331, in sync
six.reraise(*error[0])
File "/home/kbaral/anaconda3/lib/python3.7/site-packages/six.py", line 693, in reraise
raise value
File "/home/kbaral/anaconda3/lib/python3.7/site-packages/distributed/utils.py", line 316, in f
result[0] = yield future
File "/home/kbaral/anaconda3/lib/python3.7/site-packages/tornado/gen.py", line 735, in run
value = future.result()
File "/home/kbaral/anaconda3/lib/python3.7/site-packages/tornado/gen.py", line 742, in run
yielded = self.gen.throw(*exc_info) # type: ignore
File "/home/kbaral/anaconda3/lib/python3.7/site-packages/distributed/client.py", line 1653, in _gather
six.reraise(type(exception), exception, traceback)
File "/home/kbaral/anaconda3/lib/python3.7/site-packages/six.py", line 692, in reraise
raise value.with_traceback(tb)
File "/home/kbaral/anaconda3/lib/python3.7/site-packages/dask/dataframe/utils.py", line 521, in check_meta
errmsg))
ValueError: Metadata mismatch found in from_delayed.

Expected partition of type DataFrame but got NoneType
distributed.process - WARNING - reaping stray process <ForkServerProcess(ForkServerProcess-13, started daemon)>
distributed.process - WARNING - reaping stray process <ForkServerProcess(ForkServerProcess-8, started daemon)>
distributed.process - WARNING - reaping stray process <ForkServerProcess(ForkServerProcess-2, started daemon)>
distributed.process - WARNING - reaping stray process <ForkServerProcess(ForkServerProcess-12, started daemon)>
distributed.process - WARNING - reaping stray process <ForkServerProcess(ForkServerProcess-16, started daemon)>
distributed.process - WARNING - reaping stray process <ForkServerProcess(ForkServerProcess-7, started daemon)>
distributed.process - WARNING - reaping stray process <ForkServerProcess(ForkServerProcess-20, started daemon)>
distributed.process - WARNING - reaping stray process <ForkServerProcess(ForkServerProcess-1, started daemon)>
distributed.process - WARNING - reaping stray process <ForkServerProcess(ForkServerProcess-15, started daemon)>
distributed.process - WARNING - reaping stray process <ForkServerProcess(ForkServerProcess-19, started daemon)>
distributed.process - WARNING - reaping stray process <ForkServerProcess(ForkServerProcess-10, started daemon)>
distributed.process - WARNING - reaping stray process <ForkServerProcess(ForkServerProcess-14, started daemon)>
distributed.process - WARNING - reaping stray process <ForkServerProcess(ForkServerProcess-18, started daemon)>
distributed.nanny - WARNING - Worker process 155392 was killed by signal 15
distributed.nanny - WARNING - Worker process 155366 was killed by signal 15
distributed.nanny - WARNING - Worker process 155390 was killed by signal 15
distributed.nanny - WARNING - Worker process 155380 was killed by unknown signal
distributed.nanny - WARNING - Worker process 155368 was killed by unknown signal
distributed.nanny - WARNING - Worker process 155378 was killed by signal 15
distributed.nanny - WARNING - Worker process 155402 was killed by signal 15
distributed.nanny - WARNING - Worker process 155394 was killed by signal 15
distributed.nanny - WARNING - Worker process 155384 was killed by signal 15
distributed.nanny - WARNING - Worker process 155400 was killed by signal 15
distributed.nanny - WARNING - Worker process 155388 was killed by unknown signal
distributed.nanny - WARNING - Worker process 155404 was killed by signal 15
distributed.nanny - WARNING - Worker process 155396 was killed by unknown signal

@JBreunig
Copy link

No, I haven't seen those errors. I did have to play with the num_workers argument but those warning were different.

I adjusted my code from this:
https://github.com/aertslab/pySCENIC/blob/master/notebooks/pySCENIC%20-%20Integration%20with%20scanpy.ipynb

But I've now used it on over a dozen datasets without issue (10X and Smart-seq v4).

I might recommend just running this tutorial to try to determine whether it is version issues, data format, or something else.

@KabitaBaral1
Copy link

Thank you very much @JBreunig . I tweaked it and got it to work, except the part where I have to put the outputs from GRN, AUCell and regulon together into a loom file. The problem is that I ran my auc in CLI so I don't have it as matrix, and when I try to run this excerpt of code:

add_scenic_metadata(adata, auc_mtx, regulons)

I keep running into AssertionError:
blank assertion error. Do you have any other idea on creating loom file from these three files to compile them?

Thank you . I appreciate your help.

@JBreunig
Copy link

The particular step "add_scenic_metadata(adata, auc_mtx, regulons)" is only adding the metadata to the anndata structure. The loom output is farther down (export2loom) but I don't think that's the issue here.

The more likely issue is that I think that there were some variable name and folder name ambiguities in that particular notebook that you have to adjust/correct or maybe your matrices aren't matching up. For example if I recollect correctly with auc_mtx = aucell(exp_mtx, regulons, num_workers=20), I don't think that exp_mtx is loaded and so you have to do that elsewhere.

For the second potential issue, did you check that the size of your auc_mtx and exp_mtx correspond in the appropriate dimension (i.e. cell number only as genes vs. regulons will be different between the two)?
auc_mtx.shape
exp_mtx.shape

I recently was unable to perform this step on a 300,000K cell dataset, leading to me reinstalling anaconda with no luck--still troubleshooting that. But otherwise I haven't had an error as long as my inputs (adata[and thus exp_mtx], and auc_mtx) matched up. If I make an upstream change in adata that doesn't correspond to the raw matrix I'd passed to pySCENIC, it will cause issues because of size mismatches.

@KabitaBaral1
Copy link

Hi,
Thank you for the help.
Yes, I somehow fixed it but I get another error while trying to write into a loom file using the command:
export2loom(df_tpm.T, regulons, LOOM_FNAME_1, cell_annotations=adata_1.obs['cell'].to_dict(), tree_structure=(), title='Schwann_Cells', compress=True)

I keep getting this error:
Object dtype dtype('O') has no native HDF5 equivalent

I saw this error in GitHUb but found no solution to it. Did you ever run into this error?
Do you have any idea on how to fix it?

Thank you. Much appreciated. And sorry I keep bugging you.

Thank you

@JBreunig
Copy link

No worries...happy to give what little assistance I can offer ;)

Unfortunately, I haven't had time to troubleshoot that function. I tried once it didn't include the features that I wanted in SCOPE because I didn't have them in the appropriate format to add to the LOOM. It may have been that same error and it caused me to drop some of the data. (i.e., if you remove the cell_annotations, or tree structure argument, does it complete successfully? If so, something in one of those is incompatible with the HDF5 format.) linnarsson-lab/loompy#12

I general, I am just saving things as an h5ad file.

@KabitaBaral1
Copy link

I see, Thank you.
I tried without cell_annotations, and it works but it does not let me open the loom file in R. My plan was to open loom file in R for data analysis.
Did you ever get this warning for regulons?:

Regulon name does not seem to be compatible with SCOPE. It should include a space to allow selection of the TF.
Please run:
regulons = [r.rename(r.name.replace('(+)',' ('+str(len(r))+'g)')) for r in regulons]
or:
regulons = [r.rename(r.name.replace('(',' (')) for r in regulons]

I believe that this might be the issue here.
Also, if you never create the loom file, how do you analyze your output? And, do you open h5ad file in R and perform further analysis?

I think that's my main concerns. (i) if my regulons are in correct format. I followed the vignette you sent link to for regulons. (ii) where and how do I perform data analysis and (iii) how to I compile them all together.

Thank you again.

@JBreunig
Copy link

I'm mainly looking at my data in Scanpy. An alternate way I was going to try to make the loom was to start at [65] in this notebook: https://github.com/aertslab/SCENICprotocol/blob/master/notebooks/PBMC10k_SCENIC-protocol-CLI.ipynb

I haven't had a chance yet...good luck!

@Pentayouth
Copy link

What worked for me was to rename the colnames of the GRNBoost output .tsv required by the downstream wrapper runSCENIC_1_coexNetwork2modules() and save the file in the respective scenic int/ directory.

e.g.:
GRNBoost_output <- read.delim("~dir_to_GRNboost_tsv_file/ex_5000SC_network.tsv", header=FALSE)
colnames(GRNBoost_output) <- c("TF","Target","weight")
saveRDS(GRNBoost_output, file="int/1.4_GENIE3_linkList.Rds")

I believe the "header" parameter for the function "read.delim" should be "TRUE".
Using FALSE will cause an error because the header would be mistakenly treated as the first row.

@JABioinf
Copy link

Hi,
I have been running pySCENIC using singularity and got the 3 main important files: auc_mtx.csv, adjacencies.tsv, regulons.csv.
Was there any updated method to import those outputs into SCENIC in R?
Is there a way to save them to be read directly with the ScenicOptions class in R?
source: https://pyscenic.readthedocs.io/en/latest/installation.html
Thank you!

@jpezoldt
Copy link

jpezoldt commented Apr 8, 2023 via email

@xinpengdun
Copy link

the easist way is to downlaod 1.4_GENIE3_linkList.Rds, and move it to your int fold:
you can download 1.4_GENIE3_linkList.Rds from
https://scenic.aertslab.org/scenic_paper/examples/SCENIC_MouseBrain/int/

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

8 participants