Welcome!This home page contains the Supplementary Materials supporting the current series of publications titled:

"Expression variation: its relevance to disease and therapy"

Also see below the supporting data for the earlier article titled: "Successful anti-cancer targets able to pass FDA review demonstrate the identifiable signature distinct from the signatures of random genes and initially proposed targets"

Supplementary materials (R) to the publication "Expression variation: its relevance to disease and therapy"

P1.1 Initial large-scale project data on normal expression (Human Body Index):

GSE3526_series-1 GSE3526_series-2 GSE7307 Human body index-1 GSE7307 Human body index-2 GSE7307 Human body index-3

P1.2 Initial large-scale project data on cancer expression (EXPO project):

GSE2109-1 GSE2109-2 GSE2109-3 GSE2109-4 GSE2109-5 GSE2109-6 GSE2109-7 GSE2109-8 GSE2109-9

P1.3 Initial gene expression data - small scale projects:

GDS1249 Dendritic cells GDS1439 Prostate, benign and cancer GDS1553 Umbilical vein endothelial cells GDS1579 Leucocytes generic GDS1665 Thyroid norm and cancer GDS1673 Normal lung GDS1732 Thyroid, normal vs tumor GDS1973 Prostate normal GDS2118 CD34+ progenitors from a bone marrow GDS2216 Monocyte-derived dendritic cells GDS2221-monocyte derived dendritic cells GDS1439 Prostate, benign and cancer GDS2431-CD34+ Hematopoetic progenitor cells, differentiation GDS2609 colon mucos, normal and early cancer GDS2611 Epidermal keratinocytes GDS2615 Bronchial mucocilial cells GDS2628 Bronchial muscle cells GDS2635 Breast, normal and cancer, micodissected GDS2697 Normal sperm GDS2611 Proliferative endometrium normal GSE2125 Alveolar macrophages GSE2817 Glyoma GSE3045 Astrocytes GSE3141 Primary lung tumors GSE3325 Prostate, normal, cancer and metastatic GDS3678 Thyroid norm vs cancer GSE4452 Multiple myeloma untreated GSE4587 Normal skin vs melanoma GSE4757 Brain, normal vs Alzheimer’s GSE4888 Endometrium normal GSE4845 melanoma GSE5060_GPL570 Airway epithelium, smokers vs non_smokers GSE5079 Astrocytes GSE5372 Large airway epithelium, pre_ and post_ trauma GSE5504 Normal monocytes GSE5850 Normal oocytes GSE6090 Dendritic cells GSE6281 Skin GSE6798 Muscle normal vs disease GSE6338 Peripheral T_cell lymphoma GSE8672 Lymphocytes GSE9452 Colon mucosa normal vs inflamed GSE9647 huvec endothelial cells GSE9884 Bone marrow mesenchimal stem cells GSE6872 Normal sperm GSE7023 Normal renal and cancer GSE7476 Bladder norm and cancer GSE7896 Embryonic stem cells GSE7846 Endometrium normal and hyperplasia GSE8302 Normal hepatocytes GSE8514 Normal adrenal gland GSE8668 Neutrophils normal GSE8671 Normal colon mucosa and adenoma GSE7807 Normal monocytes

P2.1 Normal expression, individual tissue environments:

Accumbens Adipose tissue Adipose tissue omental Adrenal gland Amygdala Anja cells Aorta B-cell resting Bone marrow Breast Bronchus Caudate Colon Coronary artery Corpus calosum Dorsal root ganglia Endometrium Esophagus Fallopian tube GDS1439 Prostate GDS1673 Normal lung GDS1973 Prostate normal GDS2118-CD34+ Progenitors from a bone marrow GDS2609 Colon mucos, normal vs early cancer GDS2635 Normal vs cancer, microdissected Glia, normal vs cancer GSE3678 Thyroid norm vs cancer GSE4452 Multiple myeloma untreated GSE4845 melanoma, norm vs cancer GDS2615 Bronchial mucocilial cells GDS3045 Astrocytes GDS2737 Proliferative endometrium normal GSE4888 Endometrium normal GSE5060 Airway epithelium GSE5372 Large airway epithelium GSE5850 Normal oocytes GSE7807 Normal monocytes GSE7896 Embryonic stem cells GSE8514 Normal adrenal gland GSE8668 Neutrophils normal GSE8671 Normal colon mucosa, adenoma excluded GSE9894 Bobe marrow mesenchimal stem cells Heart Atrium Heart Ventricle Hep G2 cells Hippocampus Huvec cell lines Hypothalamus Joint tissue sinovium Kidney Liver Liver normal Lung Lymph nodes Medulla Midbrain Myometrium Nipple cross section Nodose nucleus Occipital lobe Oral mucosa Oral mucosa-1 Ovary Parietal lobe Penis normal Pharyngeal mucosa Pituitary Prifrontal cortex Prostate Putamen Salivary gland Skeletal muscle Skin Small intestine Spinal cord Spleen Stomach normal Substantia nigra Synovial membrane Temporal lobe Testes Thalamus Thyroid gland Tongue normal Tonsil Trachea Trigeminal ganglia Vagina Ventral tegmental area Vestibular nuclei superior Vulva normal

P2.2 Cancer expression, individual tissue environments:

Abdominal wall mass and peritoneum Appendix area Bladder Bone and cartilage Brain Breast Breast-2 Breast-3 Endometrium Esophagus Fallopian GSE2817 Glyoma GSE4452 Multiple myeloma untreated GSE4845 Melanoma Ileum and small intestine Jejunum Kidney-2 Kidney-3 Liver cancer Lung-2 Lung-3 Lung cancer Lymphatic node Myometrium cancer Omentum Ventral tegmental area Ovary-2 Omentum Muscle Pancreas Parotid gland Pelvic mass Penis Prostate Rectosigmoid Skin Rectum Renal pelvis Retroperitoneal mass and peritoneum , Spleen Stomach Stomach Testis Thyroid Tongue Ureter Uterus Uterus-2 Vulva cancer

P2.3 Paired datasets, comparing cancer and norm for the same tissue environments:

Adipose tissue, norm vs cancer Adrenal gland, norm vs cancer Bladder, norm vs cancer Breast, norm vs cancer Colon, norm vs cancer Endometrium, normal vs cancer Esophagus, normal vs cancer Fallopian tube, normal vs cancer GDS2609 Colon mucos, normal vs early cancer GDS2635 Normal vs cancer, microdissected Glia, normal vs cancer GSE3678 Thyroid norm vs cancer GSE4452 Multiple myeloma untreated GSE4845 melanoma, norm vs cancer Ileum and small intestine, norm vs cancer Kidney, norm vs cancer Liver, normal vs cancer Lung, normal vs cancer Lymphatic node, norm vs cancer, lymphoma Normal skin vs melanoma Myometrium, normal vs cancer Ovary, normal vs.cancer Pancreas, norm vs cancer Parotid gland, norm vs cancer Prostate, norm vs cancer Small intestine, norm vs cancer Stomach, normal vs cancer Thyroid gland, norm vs cancer Tongue, normal and cancer Urethra, normal vs cancer Vagina, normal vs cancer Vulva, normal vs cancer

P2.4 Integrated panel of differential expression:

Integrated panel of differential expression

P3.1 Large-scale panel of normal variability data:

P3 Read-me P3.1.1 Variabilities and expression values, combined panel P3.1.2 Expression values P3.1.3 Variabilities, Z scores P3.1.4 Variabilities, Q-Q plot P3.1.5 Variabilities, high Z scores only P3.1.6 Panel avearged variability

P3.2 The data supporting figures:

Figure 1 Figure 2 Figure 3 Figure 4 and Figure 5 Patent database searching

P3.3 Bootstrapping subsets:

Subset 1 Subset 2 Subset 3 Subset 4 Subset 5 Subset 6 Subset 7 Subset 8 Analysis

P4. Ontological analysis of the highest and lowest variability classes:

P4.1 Random-1 P4.2 Random-2 P4.3 Random-3 P4.4 Random-4 P4.5 Random-5 P4.6 Random-6 P4.7 Random-7 P4.8 Random-8 P4.9 Random-9 P4.10 Random-10 P4.11 Random-11 P4.12 Random-12 P4.13 Highest variability genes P4.14 Lowest variability genes P4.15 Aging P4.16 Alzheimer's P4.17 Atherosclerosis P4.18 Cancer P4.19 Depression P4.20 Diabetes P4.21 FDA-approved anti-cancer targets P4.22 FDA-approved non-cancer targets P4.23 Obesity P4.24 Multiple participants P4.25 Analysis

P5. Therapeutic exploitation of expression variability: optimization of cancer radioisotope therapy:

RIT1 Original normal and cancer expression and target selection

RIT2 Target candidates validated by multiple probe-sets

RIT3 Expression values for each target were averaged based on multiple probe-sets

RIT4 Selection of the best RIT targets

RIT5 Annotation of target candidastes

RIT6 Glioma targets

P6. Therapeutic exploitation of expression variability: defining criteria of a successful antibiotic target:

Prokaryotic data, GDS1827, Affymetrix platform Phylogenetic conservation analysis

Supporting data for the manuscript: "Successful anti-cancer targets able to pass FDA review dmonstrate the identifiable signature distinct from the signatures of random genes and initially proposed targets" by A. Mayburd, I. Golovchikova and J. Mulshine

Contents and brief description of this site

Rationale, Methodology and Discussion

Example Postfolio

Appendix A

Appendix B

Appendix C

The contents of the Excel files below correspond to RD (Raw data) abbreviation in the main manuscript.

List of genes used in this project (for scoring system 1): Template

Programs used in this study: Eliminator Patcher Profiler Statdata 1 Summator Datalookup 2 CorrByRow CorrByRow-ini Correlator Correlator-ini

Code to the programs: Utils.pas Datalookup.dpr uMain.dfm uMain.pas Eliminator.dpr uMain.dfm uMain.pas Highlighter.dpr uMain.dfm uMain.pas Patcher.dpr uMain.dfm uMain.pas StatData uMain.dfm uMain.pas Terminator.dpr uMain.dfm uMain.pas

Initial Data S1: S1 The gene aliases of the targets in the porject S1 Dataset A primary Affymetrix data extracted from GEO S1 Dataset A EST tag libraries composite file S1 Dataset A SAGE tag libraries composite file S1 Dataset A assembly of GEO microarry + SAGE data Dataset C

Secondary Clustering data for norm S1: GDS962 GDS946 GDS914 GDS855 GDS724 GDS707 GDS596 GDS564 GDS558 GDS539 GDS534

Secondary Clustering data for cancer S1: GDS901 GDS89 GDS884 GDS854 GDS84 GDS807 GDS760 GDS75 GDS715 GDS709 GDS690 GDS536 GDS531 GDS507 GDS536 GDS531

Classes of targets, asignment of training and testing sets S2: S2 Subdivision of successful class into training and testing sets

Successful developing and proposed targets S2: S2 Successful, developing and proposed targets

DEXCON score introduced S3: S3 DEXCON score introduced

INTVAR score introduced S4: S4 Interprobe variance assembly S4 Interprobe variance assembly- CANCER subpanel S4 Intvar scores for the total panel

SURV score introduced S5: S5 Tertiary clustering cancer panel S5 Tertiary clustering normalcy panel S5 Tertiary clustering normalcy panel - table of contents S5 Survival coefficient, its derivation and analysis

Correlative profiles and derivation of FU-test S6: S6 Breast cancer dataset correlative profiles S6 Glioma dataset correlative profiles S6 Myeloma dataset correlative profiles S6D Features assembled for individual probe-sets

Scoring system 1 and 2 aggregated and applied to Expanded dataset, S7: S7A Integration in a linear classifier S7B Relative functional enrichment

Bonus-pnalty coefficient as a function of score, aggregated system 1 and 2, S8: S8 Bonus-penalty coefficient in integrated classifier

Cross-validation data S9: S9 Cross-validaton, scoring system 1 S9 Cross-validation scoring system 1 and 2 combined

Exclusion of bias S10: S10 Exclusion of bias

ROC curve analysis S11: S11 ROC curve, system 1 S11 ROC curve, aggregated system (1+2)

Validation by literature search S12: S12 Validation by Pubmed analysis

Proof that overfit is absent S13: S13 Overfit control

Study of Alzheimer's disease S14: S14 Targets for Alzheimer's disease treatment

How many anti-cancer targets? S15: S15 Suggested anti-cancer targets 1 S15 Suggested anti-cancer targets 2