It seems to work ok if I have a zero prior, so the problem with trigamma.m comes from introducing the non-zero prior. I have no idea how to fix this, so any help you can give would be much appreciated.
==================================================
I recalled I also had such problem for some seeds.
Instead of training model with seeds= (1, 10), I trained models with seeds = (1,30) and then collected 10 successful ones.
6/13/2008
6/12/2008
Q&A in E_Coli code
1. why you transponse muD twice?
==============================
muD = net.exp.D; % muD's first column is biased column, row = 'to', col = 'from'
tmp = muD(:,2:end)'; % tmp = transpose of muD, such that in tmp row = 'from', col = 'to'
for j = 1:length(from_idx)
tmp(from_idx(j),to_idx(j)) = double(conn_attr(j))*tmp(from_idx(j),to_idx(j))*mu; % keep it row = 'from', col = 'to'
end
muD(:,2:end) = tmp'; % go back to muD's original format, i.e. row = 'to', col = 'from'
==============================
2. And are the row, column indices in from_idx, to_idx counted with the bias column included?
i.e in the E_coli example from_idx = 27. Does this mean gene 26 (with + 1 from bias column)?
from_idx, to_idx , conn_attr , all derived from the shift network prior:
% hns pp glpC 1
% hns pp glpQ 1
% hns pp cyoD 0
% hns pp cyoE 0
% hns pp cyoB 0
% hns pp cyoA 0
% hns pp sdhB -1
% hns pp arcA 0
% hns pp appY -1
% hns pp cadA -1
% hns pp cadB -1
% hns pp hdeB -1
hns is the 27th gene in common_genereg_names.mat, which is already in the E_Coli.zip, subdirectory 'step1-make_yn_inpn\matdata'.
common_genereg_names.mat is extracted from 'top50®.sif' and vsn-normalization.xls. Please refer this post (after Download) for detail.
.
Values in conn_attr.mat are 1 (positive), -1(negative) or 0 (none).
muD = net.exp.D; % muD's first column is biased column, row = 'to', col = 'from'
tmp = muD(:,2:end)'; % tmp = transpose of muD, such that in tmp row = 'from', col = 'to'
for j = 1:length(from_idx)
tmp(from_idx(j),to_idx(j)) = double(conn_attr(j))*tmp(from
end
muD(:,2:end) = tmp'; % go back to muD's original format, i.e. row = 'to', col = 'from'
==============================
2. And are the row, column indices in from_idx, to_idx counted with the bias column included?
i.e in the E_coli example from_idx = 27. Does this mean gene 26 (with + 1 from bias column)?
from_idx, to_idx , conn_attr , all derived from the shift network prior:
% hns pp glpC 1
% hns pp glpQ 1
% hns pp cyoD 0
% hns pp cyoE 0
% hns pp cyoB 0
% hns pp cyoA 0
% hns pp sdhB -1
% hns pp arcA 0
% hns pp appY -1
% hns pp cadA -1
% hns pp cadB -1
% hns pp hdeB -1
hns is the 27th gene in common_genereg_names.mat, which is already in the E_Coli.zip, subdirectory 'step1-make_yn_inpn\matdata'.
common_genereg_names.mat is extracted from 'top50®.sif' and vsn-normalization.xls. Please refer this post (after Download) for detail.
.
Values in conn_attr.mat are 1 (positive), -1(negative) or 0 (none).
12/20/2007
AUC vesus Reps reps= [1 2 4 8 16]

Download Figures: auc_reps_all_in_1.pdf and each separate figures in directory.
Note: Bars are grouped to show AT LEAST how many reps for x-arc is needed to reach some range of AUC value.
For example:
In the first subfigure (mu=1, kk =1), to reach range [0.625, 0.65], 0-arc at least reps =16 is needed, others only need reps =1.
Download AUC_data: arc0_mat; arc1_mat;arc2_mat;arc3_mat
In 1-arc test, auc value is stored by arc1_mat(mu_ind,kk_ind,reps_ind) = auc;
e.g. if mu = 2, kk =3, reps =8, the corresponding index should be arc1(2,3,4).
Setting:
murange = [1 2 4];
kkrange = [1:3];
reps = [1 2 4 8 16];
12/13/2007
AUC vesus Reps
Download:
http://www.cse.buffalo.edu/~juanli/auc_vs_reps/
p.s. mu_p5.pdf contains the legend, others don't have a legend.
Setting:
reps = [1 4 8];
murange = [0.1 0.5 1 2 4 8 12 16];
kkrange = [1:16];
seeds=[1:10];
http://www.cse.buffalo.edu/~juanli/auc_vs_reps/
p.s. mu_p5.pdf contains the legend, others don't have a legend.
Setting:
reps = [1 4 8];
murange = [0.1 0.5 1 2 4 8 12 16];
kkrange = [1:16];
seeds=[1:10];
AUC vesus Reps
Download:
http://www.cse.buffalo.edu/~juanli/auc_vs_reps/
p.s. mu_p5.pdf contains the legend, others don't have a legend.
Setting:
reps = [1 4 8];
murange = [0.1 0.5 1 2 4 8 12 16];
kkrange = [1:16];
seeds=[1:10];
http://www.cse.buffalo.edu/~juanli/auc_vs_reps/
p.s. mu_p5.pdf contains the legend, others don't have a legend.
Setting:
reps = [1 4 8];
murange = [0.1 0.5 1 2 4 8 12 16];
kkrange = [1:16];
seeds=[1:10];
10/10/2007
ROC / AUC scripts + biossm tarball
1. ROC / AUC scripts
Download: http://www.cse.buffalo.edu/~juanli/roc_auc.zip
2. biossm tarball
Download: http://www.cse.buffalo.edu/~juanli/biossm.tar
Download: http://www.cse.buffalo.edu/~juanli/roc_auc.zip
2. biossm tarball
Download: http://www.cse.buffalo.edu/~juanli/biossm.tar
8/30/2007
vsn & E_Coli calculation scripts
Download:
vsn script
E_Coli script
vbssm_v3.3.7
Both scripts are OK on my desktop, but I have no time to modify corresponding paths for your running, sorry.
readme.txt will be helpful, take a look.
-----------------------------------------------------------------------------------
actually what we should do is to run vsn_normalization data with the same priors - please could post your scripts for this ecoli calculation and the vsn_normalization calculation also. I 'll get my new student to look at them while you are away. Maybe we can talk when you return if we have any questions before you start work?
8/21/2007
E_Coli expr
Data is sightly changed on gene 'hns'!
The profile of gene 'hns' is modified to have constant of 0, -20, -100, respectively, after normalization.
---------------------------------------------------------------------------
This gene should have zero expression, which may mean that it should be constant and very low (negative) after normalization, rather than zero. -David
1. F vs kk
Hyper-opt is on_______ Hyper-opt is off
The profile of gene 'hns' is modified to have constant of 0, -20, -100, respectively, after normalization.
---------------------------------------------------------------------------
This gene should have zero expression, which may mean that it should be constant and very low (negative) after normalization, rather than zero. -David
1. F vs kk

2. Add Posterior(vsn-normalization) as Prior(E_Coli.xls --'no-inter' sheet)
Download: FreqTable(PDF): hns = 0, -20, -100
3-in-1 xls file: freq0-20-100.xls
Download: FreqTable(PDF): hns = 0, -20, -100
3-in-1 xls file: freq0-20-100.xls
8/16/2007
norm script
Download:
1. norm_genes.m
2. example_genes.xls
norm_genes contains 3 parts:
1. read xls file
2. normlize data
3. generate input data for vbssm model.
Note:
1. 'example_genes.xls' is generated by extracting first 10 genes from 'E.coli.values.xls'--'no-inter' sheet.
2. There are 2 replicates timeseries , each of 8 timepoints
3. Data is already log-transformed
1. norm_genes.m
2. example_genes.xls
norm_genes contains 3 parts:
1. read xls file
2. normlize data
3. generate input data for vbssm model.
Note:
1. 'example_genes.xls' is generated by extracting first 10 genes from 'E.coli.values.xls'--'no-inter' sheet.
2. There are 2 replicates timeseries , each of 8 timepoints
3. Data is already log-transformed
hns profiles after normalization
8/03/2007
inter
Notes:
1. Shift Subset is defined from Page 16 of Manchester.pdf
2. In muD (prior mean matrix of D), only 12 entries'signs are adjusted, based on the info in memo1. Only these 12 entries are multiplied with mu value (e.g. *0.5), when mu varies. Other entries inherit their sign and value from previous experiment.
3. 10 posterior MEAN matrices for A,B,C, D from the previous experiment (vsn_normalization), with some entries adjusted for the new priors. However, vbssm is unable to run when posterior COVARIANCE is incorporated, since 'trigamma' function will report severe problem to stop computation.
----------------------------------------------------------------------------------
Memo1:Priors
hns-> glpC, glpQ +ve (these appeared in the model network and were confirmed by the experiment)
hns-> cyo D,E,B,A no connection (not confirmed by experiment)
hns -> sdhB -ve (confirmed)
hns-> arcA no connection
hns-> appY -ve (connected confirmed but sign different)
hns-> cad A,B -ve (opposite sign)
hns -> hdeB -ve (opposite sign)
-------------------------------------------------------------------------------
Memo2. Posterior
Expt A, instead of starting with 10 random seeds, you need to start from the 10 posterior matrices for A,B,C, D from the previous experiment (vsn_normalization), with the means and variances adjusted for the new priors, i.e. the posteriors from the previous experiment become the priors for the new experiment.
-------------------------------------------------------------------------------
-----------------------------------------------------------------------------------
Note:
Q: How to present 'no connection' in prior matrix?
A: This is be a prior constrained around zero - i.e mean zero but with very tight distribution (low variance)
8/01/2007
Set Prior Covariance Matrix for A,B,C,D
vbnet examples: mu = 0 (no prior), mu = 0.1 (with prior)
Download: ARD derivation notes
Data: Zak's data
reps = 4;
kk = 6;
arc = 9; (arc 9 is added as prior arc)
---------------------------------------------------------------------------------
Hi Juan
I guess so, how did you specify priors for the ARD prior experiments?
You need to set the mean and variance I guess, which would just be the diagonal of the full covariance matrix.
-- In ARD expr,
where delta = 1*ones(pinp,1) % by default initialization in Matt's code.
So, I only need to set the mean, let variance be default -Juan
I didn't realize the code output the full covaraiances - maybe we can look at the posterior covariances to understand correlation between the parameters as I suggested earlier - can you put some samples from the ARD prior experiments (Zak's data) on the web site?
- See top
Let's try to talk at 8am Friday if that works for you. If not then Thursday 8am would also work for me
- OK, Friday 8am.
David
----------------------------------------------------------------------------------
I happen to realize that, vbssm specifies the Prior Covariance matrix of D as diagonal, not full matrix.
(see the attached derivation Page 2, Equation 7)
But, I have checked that the Posterior Covariance matrix of D obtained from the previous experiment (vsn_normalization) is actually a full matrix. How should we treat it? Is it OK to let the non-diagonal entries be zero, in order to fit the vbssm model?
-Juan
Download: ARD derivation notes
Data: Zak's data
reps = 4;
kk = 6;
arc = 9; (arc 9 is added as prior arc)
---------------------------------------------------------------------------------
Hi Juan
I guess so, how did you specify priors for the ARD prior experiments?
You need to set the mean and variance I guess, which would just be the diagonal of the full covariance matrix.
-- In ARD expr,
So, I only need to set the mean, let variance be default -Juan
I didn't realize the code output the full covaraiances - maybe we can look at the posterior covariances to understand correlation between the parameters as I suggested earlier - can you put some samples from the ARD prior experiments (Zak's data) on the web site?
- See top
Let's try to talk at 8am Friday if that works for you. If not then Thursday 8am would also work for me
- OK, Friday 8am.
David
----------------------------------------------------------------------------------
I happen to realize that, vbssm specifies the Prior Covariance matrix of D as diagonal, not full matrix.
(see the attached derivation Page 2, Equation 7)
But, I have checked that the Posterior Covariance matrix of D obtained from the previous experiment (vsn_normalization) is actually a full matrix. How should we treat it? Is it OK to let the non-diagonal entries be zero, in order to fit the vbssm model?
-Juan
F vs kk for "E_Coli_ no_inter_sheet"
7/19/2007
Accumulative vs Non-Accumulative for Zak's Data
Download:
Accumulative figures
Non-Accumulative figures
zak_accu_nonaccu.zip
Data: 'Zak's'.
reps = [1 4 8]; stdrange = [1.66 2.33 3]
kkrange = [1:16]; // According to Matt, we didn't explore optimal kk value for Zak's data
murange = [0 .1 .5 1 2 4 8 12 16 32 64 100];
seedrange = [1:10];
Results are the average of 10 models (seeds).
Accumulative figures
Non-Accumulative figures
zak_accu_nonaccu.zip
Data: 'Zak's'.
reps = [1 4 8]; stdrange = [1.66 2.33 3]
kkrange = [1:16]; // According to Matt, we didn't explore optimal kk value for Zak's data
murange = [0 .1 .5 1 2 4 8 12 16 32 64 100];
seedrange = [1:10];
Results are the average of 10 models (seeds).
7/10/2007
Prior Test for "Shift Subset"
Download: PriorTest_Shift_subset.pdf
Notes:
1. Shift Subst is defined from Page 16 of Manchester.pdf
2. Priors are defined from Page 15 of Manchester.pdf
4 verified interactions are incorporated as Priors for vbssm model:
hns pd appY
hns pd cadA
hns pd cadB
hns pd hdeB
hns pd cadA
hns pd cadB
hns pd hdeB
The other 2 verified interactions do NOT belong to Shift Subset
arcA pd hybB
gutM pd srlR
gutM pd srlR
6/21/2007
Aucroc T = 6 figure


reps = [2 4 8 16], T = [6 12 120]

reps = 4, T = [6 12 120]
6/04/2007
Zak data, scripts and our simulated data sets

Note: Folder "Matt_profiles" contains the script Matt wrote to generate the plot of the noisy versions of the mRNAs.
i.e. 'MA','MB','MC','MD','ME','MF','MG','MH','MJ','MK'
6/02/2007
Accumulative vs Non-Accumulative Priors


Download: (Average of 10 seeds)
1. Accumulative results: Z = 1.65, Z = 2.33, Z= 3
2. Non-Accumulative results: Z = 1.65, Z = 2.33, Z = 3
3.Sorted "top50 & reg" as blocks ( PDF / XLS), gray indicated no entry in vsn-normalization.xls
1. Accumulative results: Z = 1.65, Z = 2.33, Z= 3
2. Non-Accumulative results: Z = 1.65, Z = 2.33, Z = 3
3.Sorted "top50 & reg" as blocks ( PDF / XLS), gray indicated no entry in vsn-normalization.xls
Setting:
k = 6 (optimum);
seedrange = [1:10];
murange = [0 .1 .5 1 2 4 8 12 16 32 64 100];
Subscribe to:
Posts (Atom)