Notes:
1. Shift Subset is defined from Page 16 of Manchester.pdf
2. In muD (prior mean matrix of D), only 12 entries'signs are adjusted, based on the info in memo1. Only these 12 entries are multiplied with mu value (e.g. *0.5), when mu varies. Other entries inherit their sign and value from previous experiment.
3. 10 posterior MEAN matrices for A,B,C, D from the previous experiment (vsn_normalization), with some entries adjusted for the new priors. However, vbssm is unable to run when posterior COVARIANCE is incorporated, since 'trigamma' function will report severe problem to stop computation.
----------------------------------------------------------------------------------
Memo1:Priors
hns-> glpC, glpQ +ve (these appeared in the model network and were confirmed by the experiment)
hns-> cyo D,E,B,A no connection (not confirmed by experiment)
hns -> sdhB -ve (confirmed)
hns-> arcA no connection
hns-> appY -ve (connected confirmed but sign different)
hns-> cad A,B -ve (opposite sign)
hns -> hdeB -ve (opposite sign)
-------------------------------------------------------------------------------
Memo2. Posterior
Expt A, instead of starting with 10 random seeds, you need to start from the 10 posterior matrices for A,B,C, D from the previous experiment (vsn_normalization), with the means and variances adjusted for the new priors, i.e. the posteriors from the previous experiment become the priors for the new experiment.
-------------------------------------------------------------------------------
-----------------------------------------------------------------------------------
Note:
Q: How to present 'no connection' in prior matrix?
A: This is be a prior constrained around zero - i.e mean zero but with very tight distribution (low variance)
8/03/2007
inter
8/01/2007
Set Prior Covariance Matrix for A,B,C,D
vbnet examples: mu = 0 (no prior), mu = 0.1 (with prior)
Download: ARD derivation notes
Data: Zak's data
reps = 4;
kk = 6;
arc = 9; (arc 9 is added as prior arc)
---------------------------------------------------------------------------------
Hi Juan
I guess so, how did you specify priors for the ARD prior experiments?
You need to set the mean and variance I guess, which would just be the diagonal of the full covariance matrix.
-- In ARD expr,
where delta = 1*ones(pinp,1) % by default initialization in Matt's code.
So, I only need to set the mean, let variance be default -Juan
I didn't realize the code output the full covaraiances - maybe we can look at the posterior covariances to understand correlation between the parameters as I suggested earlier - can you put some samples from the ARD prior experiments (Zak's data) on the web site?
- See top
Let's try to talk at 8am Friday if that works for you. If not then Thursday 8am would also work for me
- OK, Friday 8am.
David
----------------------------------------------------------------------------------
I happen to realize that, vbssm specifies the Prior Covariance matrix of D as diagonal, not full matrix.
(see the attached derivation Page 2, Equation 7)
But, I have checked that the Posterior Covariance matrix of D obtained from the previous experiment (vsn_normalization) is actually a full matrix. How should we treat it? Is it OK to let the non-diagonal entries be zero, in order to fit the vbssm model?
-Juan
Download: ARD derivation notes
Data: Zak's data
reps = 4;
kk = 6;
arc = 9; (arc 9 is added as prior arc)
---------------------------------------------------------------------------------
Hi Juan
I guess so, how did you specify priors for the ARD prior experiments?
You need to set the mean and variance I guess, which would just be the diagonal of the full covariance matrix.
-- In ARD expr,
So, I only need to set the mean, let variance be default -Juan
I didn't realize the code output the full covaraiances - maybe we can look at the posterior covariances to understand correlation between the parameters as I suggested earlier - can you put some samples from the ARD prior experiments (Zak's data) on the web site?
- See top
Let's try to talk at 8am Friday if that works for you. If not then Thursday 8am would also work for me
- OK, Friday 8am.
David
----------------------------------------------------------------------------------
I happen to realize that, vbssm specifies the Prior Covariance matrix of D as diagonal, not full matrix.
(see the attached derivation Page 2, Equation 7)
But, I have checked that the Posterior Covariance matrix of D obtained from the previous experiment (vsn_normalization) is actually a full matrix. How should we treat it? Is it OK to let the non-diagonal entries be zero, in order to fit the vbssm model?
-Juan
F vs kk for "E_Coli_ no_inter_sheet"
7/19/2007
Accumulative vs Non-Accumulative for Zak's Data
Download:
Accumulative figures
Non-Accumulative figures
zak_accu_nonaccu.zip
Data: 'Zak's'.
reps = [1 4 8]; stdrange = [1.66 2.33 3]
kkrange = [1:16]; // According to Matt, we didn't explore optimal kk value for Zak's data
murange = [0 .1 .5 1 2 4 8 12 16 32 64 100];
seedrange = [1:10];
Results are the average of 10 models (seeds).
Accumulative figures
Non-Accumulative figures
zak_accu_nonaccu.zip
Data: 'Zak's'.
reps = [1 4 8]; stdrange = [1.66 2.33 3]
kkrange = [1:16]; // According to Matt, we didn't explore optimal kk value for Zak's data
murange = [0 .1 .5 1 2 4 8 12 16 32 64 100];
seedrange = [1:10];
Results are the average of 10 models (seeds).
7/10/2007
Prior Test for "Shift Subset"
Download: PriorTest_Shift_subset.pdf
Notes:
1. Shift Subst is defined from Page 16 of Manchester.pdf
2. Priors are defined from Page 15 of Manchester.pdf
4 verified interactions are incorporated as Priors for vbssm model:
hns pd appY
hns pd cadA
hns pd cadB
hns pd hdeB
hns pd cadA
hns pd cadB
hns pd hdeB
The other 2 verified interactions do NOT belong to Shift Subset
arcA pd hybB
gutM pd srlR
gutM pd srlR
6/21/2007
Aucroc T = 6 figure


reps = [2 4 8 16], T = [6 12 120]

reps = 4, T = [6 12 120]
6/04/2007
Zak data, scripts and our simulated data sets

Note: Folder "Matt_profiles" contains the script Matt wrote to generate the plot of the noisy versions of the mRNAs.
i.e. 'MA','MB','MC','MD','ME','MF','MG','MH','MJ','MK'
6/02/2007
Accumulative vs Non-Accumulative Priors


Download: (Average of 10 seeds)
1. Accumulative results: Z = 1.65, Z = 2.33, Z= 3
2. Non-Accumulative results: Z = 1.65, Z = 2.33, Z = 3
3.Sorted "top50 & reg" as blocks ( PDF / XLS), gray indicated no entry in vsn-normalization.xls
1. Accumulative results: Z = 1.65, Z = 2.33, Z= 3
2. Non-Accumulative results: Z = 1.65, Z = 2.33, Z = 3
3.Sorted "top50 & reg" as blocks ( PDF / XLS), gray indicated no entry in vsn-normalization.xls
Setting:
k = 6 (optimum);
seedrange = [1:10];
murange = [0 .1 .5 1 2 4 8 12 16 32 64 100];
5/20/2007
Subset recovery for Shift -- Freqency Show

Download: pdf / .sif (with freqency on right)
Data: Shift network of vsn_normalization.xls
Setting: 10 seeds, k = 6 (optimum), mu = 0 (no prior incorporated)
Download: Script
Readme:
1. Run script to get the file containing hit log
Script to Run: "runS_subset_prior_freq.m"
Input:
S_yn.mat
S_inpn.mat
S_inpn.mat
prior_inter.mat
Dependencies:
find_arcs.m
Output:
hitlog/hit_subset
2. Analyze the hit log file
Script to Run : "analyze_hitlog_freq.m"
Input:
hitlog/hit_subset
Output:
freq_Prior_subset_Shift_vsn_top50reg.txt
Format:
from pd to mu= 0 .1 .5 1 2 4 8 12 16 32 64 100
cadB pd cadA 5 9 4 4 5 7 4 2 5 7 4 6
The number behind "cadB pd cadA" indicated this edge occurances among 10 different random seeds models, for different mu.
5/10/2007
'top50 & reg' 'Block-Prior' Freqency Show
Data: vsn-normalization.xls 'Control' Data
Setting:
murange = [0 .1 .5 1 2 4 8 12 16 32 64 100]; Download all figures while mu varies
10 seeds for vbssm model training
X-label: true arc index
Y-label: frequency of the arc recovered in 10 seeds training
Notes:
1) These figures indicate mu=0.5 are optimal
2) tdcA-arcs are more easily recovered then tdcR-arcs
*The frequency analysis is *AGREE with the previous average analysis.
5/03/2007
'top50 & reg' 'Block-Prior' Average Show
Data: vsn-normalization.xls 'Control' Data
Setting:
murange = [0 .1 .5 1 2 4 8 12 16 32 64 100];
10 seeds for vbssm model training
Procedures:
1. Sort 'top50 & reg' as blocks: pdf / xls
2. In k-th experiment, incoporate blocks[1:k] as prior, take the average of 10 seeds for significance computation
3. Analyze the recovered true arcs, Total # = 10.
They are grouped into 2 catogaries: tdcA-arcs & tdcR-arcs, where:
tdcA-arcs: (color is agree with legend)
tdcA pd tdcB
tdcA pd tdcC
tdcA pd tdcD
tdcA pd tdcE
tdcA pp tdcA
More details about this work
1. Sort the 'top50 & reg' as blocks.
That means, arcs with same 'from-gene' are grouped as one block, ignore the 'to-gene'.
E.g. The group containing all tdcA-arcs are named block-1, the group containing all tdcR-arcs are named block-2.
block-idx from to
>From block-3, block-index is numbered by the alphabet order of 'from-genes'
2. Add prior for vbssm training.
There are 29 experiment, since the block # I sort in 'top50®' is exactly 29. Each experiment also explores mu range.
1st test, add all tdcA-arcs (i.e. block-1) as prior
2nd test, keep the tdcA-arcs (block-1) as prior, meanwhile add tdcR-arcs (block-2) as prior.
....
29th test, all arcs (i.e. block(1:29) ) are added as priors.
3. Organize the 29 experiment data, each murange = [0 .1 .5 1 2 4 8 12 16 32 64 100]
Since mu value play an important role in recovery evaluation, I made the figure to reflect this point.
There is no true network at hand, only 10 known arcs. I named them as 2 groups
tdcA-arcs:(from gene = tdcA, to gene = don't care)
tdcA pd tdcB
tdcA pd tdcC
tdcA pd tdcD
tdcA pd tdcE
tdcA pp tdcA
tdcR-arcs:(from gene = tdcR, to gene = don't care)
tdcR pp tdcA
tdcR pp tdcB
tdcR pp tdcC
tdcR pp tdcE
tdcR pp tdcD
Different colors, blue and pink, to demonstrate 10 true arcs recovery results. Blue = tdcA-arcs, pink = tdcR-arcs.
The bule+pink stack gives the total number of vbssm identified true arcs.
Based on my understanding, figure revealed at least 2 information:
1. mu = 0.5 is optimum mu value, at which # of recovered arcs reaches peak.
2. In global view, pink-arcs only showed with some special mu, whileas, blue-arcs are not significantly affected by mu value.
tdcA pd tdcD
tdcA pd tdcE
tdcA pp tdcA
tdcR-arcs: (color is agree with legend)
tdcR pp tdcA
tdcR pp tdcB
tdcR pp tdcC
tdcR pp tdcE
tdcR pp tdcD
tdcR pp tdcC
tdcR pp tdcE
tdcR pp tdcD
More details about this work
1. Sort the 'top50 & reg' as blocks.
That means, arcs with same 'from-gene' are grouped as one block, ignore the 'to-gene'.
E.g. The group containing all tdcA-arcs are named block-1, the group containing all tdcR-arcs are named block-2.
block-idx from to
1 | tdcA | tdcA |
1 | tdcA | tdcB |
1 | tdcA | tdcD |
1 | tdcA | tdcE |
1 | tdcA | tdcF |
1 | tdcA | tdcG |
1 | tdcA | tdcC |
2 | tdcR | tdcA |
2 | tdcR | tdcB |
2 | tdcR | tdcC |
2 | tdcR | tdcE |
2 | tdcR | tdcF |
2 | tdcR | tdcG |
2 | tdcR | tdcD |
>From block-3, block-index is numbered by the alphabet order of 'from-genes'
2. Add prior for vbssm training.
There are 29 experiment, since the block # I sort in 'top50®' is exactly 29. Each experiment also explores mu range.
1st test, add all tdcA-arcs (i.e. block-1) as prior
2nd test, keep the tdcA-arcs (block-1) as prior, meanwhile add tdcR-arcs (block-2) as prior.
....
29th test, all arcs (i.e. block(1:29) ) are added as priors.
3. Organize the 29 experiment data, each murange = [0 .1 .5 1 2 4 8 12 16 32 64 100]
Since mu value play an important role in recovery evaluation, I made the figure to reflect this point.
There is no true network at hand, only 10 known arcs. I named them as 2 groups
tdcA-arcs:(from gene = tdcA, to gene = don't care)
tdcA pd tdcB
tdcA pd tdcC
tdcA pd tdcD
tdcA pd tdcE
tdcA pp tdcA
tdcR-arcs:(from gene = tdcR, to gene = don't care)
tdcR pp tdcA
tdcR pp tdcB
tdcR pp tdcC
tdcR pp tdcE
tdcR pp tdcD
Different colors, blue and pink, to demonstrate 10 true arcs recovery results. Blue = tdcA-arcs, pink = tdcR-arcs.
The bule+pink stack gives the total number of vbssm identified true arcs.
Based on my understanding, figure revealed at least 2 information:
1. mu = 0.5 is optimum mu value, at which # of recovered arcs reaches peak.
2. In global view, pink-arcs only showed with some special mu, whileas, blue-arcs are not significantly affected by mu value.
4/04/2007
ARD Net
figure parameter: 3-arc-idx = 9 , reps=8 , kk = 15
seedrange = [1:10]; murange = [.1 .5 1 2 4 8 12 16];
vbssm output for 3-arc-idx = 9, kkrange = [1:16] is at
'/home/csgrad/juanli/work/log/triplet_128'
Net Name Explanation:
'net_2_arc9_kk15_seed1.mat' means
3-arc-idx = 9, kk = 15, seed = 1, mu = 2;
*p1 = 0.1; p5 = 0.5
--------------------------------------------------------------------------------------------
Juan
I need the vbssm output to be able to look at possible correlations in the CBD matrix. Please can you generate this in the first instance for the model with arc =3, reps=8 and kk = 15 say, the same values that were in the figure 3arc_info_auc that you sent me. Ideally, I would like all models k = 1:16 - these could be in separate .mat files to reduce storage space.
David
---------------------------------------------------------------------------------------------
seedrange = [1:10]; murange = [.1 .5 1 2 4 8 12 16];
vbssm output for 3-arc-idx = 9, kkrange = [1:16] is at
'/home/csgrad/juanli/work/log/triplet_128'
Net Name Explanation:
'net_2_arc9_kk15_seed1.mat' means
3-arc-idx = 9, kk = 15, seed = 1, mu = 2;
*p1 = 0.1; p5 = 0.5
--------------------------------------------------------------------------------------------
Juan
I need the vbssm output to be able to look at possible correlations in the CBD matrix. Please can you generate this in the first instance for the model with arc =3, reps=8 and kk = 15 say, the same values that were in the figure 3arc_info_auc that you sent me. Ideally, I would like all models k = 1:16 - these could be in separate .mat files to reduce storage space.
David
---------------------------------------------------------------------------------------------
3/30/2007
Explore mu range on "Contro " of vsn-normalizaion
3/26/2007
ARD Data
1-arc: reps = 1 / 4 / 8
2-arc: reps = 1 / 4 / 8
3-arc: reps = 1 / 4 / 8
Each includes matrice trained from10 seeds.
The data structure is organized as:
Sample | reps | arc_ind | kk | mu | auc-val
pair_map.mat gives the pair_index, and triplet_map gives the triplet_index, mapped from 12 single arcs.
see also: ARD Experiment, ARD scipts, ARD results
2-arc: reps = 1 / 4 / 8
3-arc: reps = 1 / 4 / 8
Each includes matrice trained from10 seeds.
The data structure is organized as:
Sample | reps | arc_ind | kk | mu | auc-val
pair_map.mat gives the pair_index, and triplet_map gives the triplet_index, mapped from 12 single arcs.
see also: ARD Experiment, ARD scipts, ARD results
3/20/2007
ARD Scripts
Collected in http://www.cse.buffalo.edu/~juanli/ard_scripts/
It includes the scripts for ARD experiments.
part 1
run_single_121.m (1-arc prior, sample = 12, reps = 1)
run_single_124.m (1-arc prior, sample = 12, reps = 4)
run_single_128.m (1-arc prior, sample = 12, reps = 8)
run_pair_121 .m (2-arc prior, sample = 12, reps = 1)
run_pair_124 .m (2-arc prior, sample = 12, reps = 4)
run_pair_128 .m (2-arc prior, sample = 12, reps = 8)
run_triplet_121.m (3-arc prior, sample = 12, reps = 1)
run_triplet_124.m (3-arc prior, sample = 12, reps = 4)
run_triplet_128.m (3-arc prior, sample = 12, reps = 8)
part2
some auxilary scripts to calculate auc/roc quantity.
In addition, pair_map.mat gives the pair_index, and triplet_map gives the triplet_index, mapped from 12 single arcs.
part3
readme explains what the various fields in the network structure are--------------------------------------------------------
Please can you put the matlab files which contain the model parameters for your ARD experiments on the web site also, together with a readme file which explains what the various fields in the network structure are?
It includes the scripts for ARD experiments.
part 1
run_single_121.m (1-arc prior, sample = 12, reps = 1)
run_single_124.m (1-arc prior, sample = 12, reps = 4)
run_single_128.m (1-arc prior, sample = 12, reps = 8)
run_pair_121 .m (2-arc prior, sample = 12, reps = 1)
run_pair_124 .m (2-arc prior, sample = 12, reps = 4)
run_pair_128 .m (2-arc prior, sample = 12, reps = 8)
run_triplet_121.m (3-arc prior, sample = 12, reps = 1)
run_triplet_124.m (3-arc prior, sample = 12, reps = 4)
run_triplet_128.m (3-arc prior, sample = 12, reps = 8)
part2
some auxilary scripts to calculate auc/roc quantity.
In addition, pair_map.mat gives the pair_index, and triplet_map gives the triplet_index, mapped from 12 single arcs.
part3
readme explains what the various fields in the network structure are--------------------------------------------------------
Please can you put the matlab files which contain the model parameters for your ARD experiments on the web site also, together with a readme file which explains what the various fields in the network structure are?
3/19/2007
VBSSM Release Issue
vbssm_v3.4.1, including the local/remote running script example is at
http://www.cse.buffalo.edu/~juanli/vbssm341.tgz
1. From David
Please could you check that the latest vbssm software, with prior incorporation is included in the release at
http://www.cse.buffalo.edu/faculty/mbeal/vbssm.html
Chis Miller should be able to help you.
What release version is this? -David
-- This release version is the original version 3.0 (08/11/03) Matt posted. (Juan)
2. From David
Have you worked on any of these yet?
In upcoming releases v3.4+
Provide sample scripts to demonstrate features new in release v3.4. -Yes
Allow missing (unobserved) entire time points in the data (smooth, predict, and feedback). -No
Allow for missing individual dimensions at some time points (sensor failure). -No. (Juan)
3. From David
We need the scripts for cluster usage and some example scripts to be included in the tar file
Please can you make sure this is done before 3/30?
-Yes. Download vbssmv3.4.1
http://www.cse.buffalo.edu/~juanli/vbssm341.tgz
1. From David
Please could you check that the latest vbssm software, with prior incorporation is included in the release at
http://www.cse.buffalo.edu/faculty/mbeal/vbssm.html
Chis Miller should be able to help you.
What release version is this? -David
-- This release version is the original version 3.0 (08/11/03) Matt posted. (Juan)
2. From David
Have you worked on any of these yet?
In upcoming releases v3.4+
Provide sample scripts to demonstrate features new in release v3.4. -Yes
Allow missing (unobserved) entire time points in the data (smooth, predict, and feedback). -No
Allow for missing individual dimensions at some time points (sensor failure). -No. (Juan)
3. From David
We need the scripts for cluster usage and some example scripts to be included in the tar file
Please can you make sure this is done before 3/30?
-Yes. Download vbssmv3.4.1
3/13/2007
Example AUC curves to show info arc/k
replicates = 1: 1-arc example / 2-arc example / 3-arc example
replicates = 4: 1-arc example / 2-arc example
replicates = 8: 1-arc example / 2-arc example / 3-arc example
Info Table: pdf / xls
--------------------------------------------------------
Q: shouldn't the y-axis be labelled 'delta auc' ?
A: It is the original "auc" curve to show ROC-C better than ROC-B, Whereas the delta-auc curve is aimed to compare the increase quantity among 1-arc, 2-arc and 3-arc priors.
replicates = 4: 1-arc example / 2-arc example
replicates = 8: 1-arc example / 2-arc example / 3-arc example
Info Table: pdf / xls
--------------------------------------------------------
Q: shouldn't the y-axis be labelled 'delta auc' ?
A: It is the original "auc" curve to show ROC-C better than ROC-B, Whereas the delta-auc curve is aimed to compare the increase quantity among 1-arc, 2-arc and 3-arc priors.
3/08/2007
F_vs_K Plot on 'vsn-normalization.xls' -- Adding prior
Recovered Network (PDF) Threshold = 1.6
Data: vsn-normalization.xls. ONLY Control network
Steps:
1. Adding 1 known connection as prior each time. Total is 10 tests, coming from 10 connections.
2. Retrain vbssm model and find optimal K value (10 retrainings all show k = 6 is the optimum)
3. Compare the recovered network with the known connections
Notes:
1." tdcF" and "tdcG" have *No* Entries in vsn-normalization.xls. Therefore, the following 4 intersections,
tdcA pp tdcF
tdcA pp tdcG
tdcR pp tdcF
tdcR pp tdcG
are excluded in the prior-test. There are 10 known connetions as prior candidates.tdcA pp tdcG
tdcR pp tdcF
tdcR pp tdcG
2. In CBDioZ, threshold = 1.6 / 3, consistent with that in the previous no-prior test.
------- Original Email --------------
Juan
I looked at your results. I see a few of the known connections in
top50®.sif in any of the reconstructed Control network. For instance,
tdcA pd tdcB
tdcA pd tdcC
tdcA pd tdcD
tdcA pd tdcE
are ok but the following are missing
tdcA pp tdcA
tdcA pp tdcF
tdcA pp tdcG
tdcR pp tdcA
tdcR pp tdcB
tdcR pp tdcC
tdcR pp tdcE
tdcR pp tdcF
tdcR pp tdcG
tdcR pp tdcD
You need to cross-check the .sif files carefully to see what is correct.
What I suggest is that we concentrate on the Control network for now, and start to add the known connections in top50®.sif as priors. You might want to do these 1 regulator at a time, and retrain the vbssm models. I guess you will need to repeat the F vs K plots first. What we ought to see is that known connections persist in the models.
By the way ihfAihfB pp tdcA should be
ihfA pp tdcA
ihfB pp tdcA
etc.
Please let me know if you have any questions about this.
Thanks
David
3/05/2007
F_vs_K Plot on Vichy's Data
Vichy Data (70 genes) PDF
-----------------------------------------------------------------
Linda
I looked at the plot and it looks fine.
The message below the plot suggests a matalb version problem. Can you
check if this is the problem?
You should now be able to proceed with producing a model for k =8 .
Maybe Juan has the matlab matrix file already and she can post it on the
website
David
-----------------------------------------------------------------------------------
Hi, David and Linda
This is the F vs K plot (10 seeds). k=8 is the optimum.
I had no trouble generating matrics (yn.mat and inpn.mat). I looked into the xls file and found from line 32 there are some notations following numerical data in the same row. I'm guessing the problem is probably coming from the function 'xlsread'. Is your MATLAB too old? I'm using MATLAB 7.0.1.
-Juan
--------------------------------------------------------------------------
Juan
Would you be able to help Linda debug this? I won't have any time to look
at it before next week. First key steps would be to produce F vs K plots
for this data. I don't un derstand what Linda means by "it would only
accept 31 out of the 70 genes".
Many thanks
David
---------- Forwarded message ----------
Date: Fri, 2 Mar 2007 17:05:25 -0000
From: "Hughes, Linda"
To: "'D.L.Wild@warwick.ac.uk' (E-mail)"
Cc: "B-Wollaston, Vicky"
Subject: modelling
Hi david
i ran vickys data through the modelling scripts. The bad news is that it
would only accept 31 of the 70 genes i wanted to put through to generate the
matrices. im not sure if this is a problem with the excel file or matlab as
i re-created the excel file a number of times but it still wont work.
i wonder if you could run the script with this excel file on your laptop so
i know where the problem is coming from.
<> <>
i decided to carry on with the 31 that would work and have run the FSvskk
script without any problem, heres a pdf of the output with a single seed (i
will put everything on the weblog when i come back) just ignore the control
data on the left. it seems that matrix 6 is the optimum.
<>
i was also wondering if you would send me a screen dump of the commands you
used to run CBDioZ after you loaded the relevant matrix for the example data
as being a newbie to matlab, im still struggling with syntax issues; im
particularly interested in the Imagesc command to view the interactions.
lastly im now connected to the buffalo servers and have been trying to run
theFSvskk script using the cluster, i was wondering if you new whether
matlab is installed on the server or whether i need to run it another way?
else i can just ask the buffalo people
sorry bout the long email, i just needed to ask everything before i go on
holiday and give you an idea of the current state of play
thanks
linda
Linda
I looked at the plot and it looks fine.
The message below the plot suggests a matalb version problem. Can you
check if this is the problem?
You should now be able to proceed with producing a model for k =8 .
Maybe Juan has the matlab matrix file already and she can post it on the
website
David
-----------------------------------------------------------------------------------
Hi, David and Linda
This is the F vs K plot (10 seeds). k=8 is the optimum.
I had no trouble generating matrics (yn.mat and inpn.mat). I looked into the xls file and found from line 32 there are some notations following numerical data in the same row. I'm guessing the problem is probably coming from the function 'xlsread'. Is your MATLAB too old? I'm using MATLAB 7.0.1.
-Juan
--------------------------------------------------------------------------
Juan
Would you be able to help Linda debug this? I won't have any time to look
at it before next week. First key steps would be to produce F vs K plots
for this data. I don't un derstand what Linda means by "it would only
accept 31 out of the 70 genes".
Many thanks
David
---------- Forwarded message ----------
Date: Fri, 2 Mar 2007 17:05:25 -0000
From: "Hughes, Linda"
To: "'D.L.Wild@warwick.ac.uk' (E-mail)"
Cc: "B-Wollaston, Vicky"
Subject: modelling
Hi david
i ran vickys data through the modelling scripts. The bad news is that it
would only accept 31 of the 70 genes i wanted to put through to generate the
matrices. im not sure if this is a problem with the excel file or matlab as
i re-created the excel file a number of times but it still wont work.
i wonder if you could run the script with this excel file on your laptop so
i know where the problem is coming from.
<
i decided to carry on with the 31 that would work and have run the FSvskk
script without any problem, heres a pdf of the output with a single seed (i
will put everything on the weblog when i come back) just ignore the control
data on the left. it seems that matrix 6 is the optimum.
<
i was also wondering if you would send me a screen dump of the commands you
used to run CBDioZ after you loaded the relevant matrix for the example data
as being a newbie to matlab, im still struggling with syntax issues; im
particularly interested in the Imagesc command to view the interactions.
lastly im now connected to the buffalo servers and have been trying to run
theFSvskk script using the cluster, i was wondering if you new whether
matlab is installed on the server or whether i need to run it another way?
else i can just ask the buffalo people
sorry bout the long email, i just needed to ask everything before i go on
holiday and give you an idea of the current state of play
thanks
linda
Subscribe to:
Posts (Atom)