Summary
	Online application for survival analysis (OASIS 2) is a one-stop tool for various statistical tasks involved in analyzing survival data in a user-friendly manner. OASIS 2 provides a uniform platform that is an essential application to facilitate efficient statistical analyses of survival data in the ageing field.

	The statistical features of OASIS 2 include the calculation of Kaplan-Meier estimates, mean/median lifespan, mortality rate, Mantel-Cox Log-Rank test, Fisher's exact test, weighted Log-Rank test, Kolmogorov-Smirnov test and Neyman's smooth test. Moreover, OASIS 2 generates survival and mortality curves that can be easily exported and modified by using common graphic softwares.

Download
	We provides OASIS 2 source codes for statistical analyses of survival data. All codes are provided as open source under a GNU General Public License (GPL).
	Basic Survival Analysis
		Python (higher than 2.4) and matplotlib should be properly installed. [ Download ]
	Statistical testing methods
		Python (higher than 2.4), numpy, rpy, statlib, chow test, R, and surv2sample (R package) should be properly installed. [ Download ]
	Cox proportional hazards regression
		Python (higher than 2.4), rpy, R, and coxrobust (R package) should be properly installed. [ Download ]

Basic survival analysis
	Input Format

		As shown in above figure, OASIS 2 takes following format of input data for basic analysis and statistical testing between samples. The data should be tab-delimited. You can download example files : example1, example2

		% Condition Identifier (Required) [Total number of subjects] (Optional) # Comment (Optional) Survival Data (Required, The data should be Tab-Delimited) First column should be time (Required) Second column should be the number of deaths (Required) Other columns are the number of censored data withe different reasons (At least one column is required)
	Output Format
		In basic survival analysis, OASIS 2 provides several survival statistics such as Kaplan-Meier estimator, Mean/Median lifespan, Survival curve and Mortality curves which can help to user interpret their survival data.
	Statistics for the estimation of observed survival In ageing research, a description of survival data such as the estimation of mean lifespan is essential for determining the effects of a drug treatment or genetic manipulation on ageing. Thus, one of the primary objectives of survival analysis is the estimation of survival function from incomplete datasets. To estimate survival time as the area under the survival curve, it is necessary to characterize the survival function which is a probability of death after some specific time t. 1. Kaplan-Meier estimator The Kaplan-Meier estimator estimates the survival or the hazard function from observation of lifetimes. It is kind of explanatory method for the time to event, where the time is considered as the most prominent variable. In lifespan studies, it might be used to measure the fraction of organisms living for a certain amount of time after event. To estimate survival time as the area under the survival curve, it is necessary to characterize the survival function which is a probability of death after some specific time t. where t is some specific time, T is a random variable of the time, and P denotes the probability of death. Generally, we only observe censored data which include missing subjects. The Kaplan-Meier or product-limit estimator was proposed for right censored analysis and it is the most common method of estimating the survival function S(t). where is the conditional probability of surviving during the j^th interval, d_j is the number of deaths and n_i is the size of the population at risk during the j^th interval. The variance of Kaplan-Meier estimator at time point ti can be estimated by Greenwood's formula. Although the calculated variance tends to underestimate the true variance of Kaplan-Meier estimator for small to moderate-sized samples, it is relatively close to true variance and therefore commonly used in various studies. where is the Kaplan-Meier estimator, dj is the number of deaths and ni is the size of the population at risk during the jth interval. Standard error of Kaplan-Meier estimator was given by 2. Mean lifespan Mean survival time is estimated as the area under the survival curve. The estimator is based upon the entire range of data. The Irwin's restricted mean lifespan, which was described by Kaplan and Meier as well, was estimated as the area under the survival curve by the following formula. where is the largest observed time and is estimated with Kaplan-Meier method. 3. Age in days at % mortality This shows the closest observed day that certain percentage mortality passed. Because this gives the closest observed day, the age in days in % mortality gives a descriptive result. You cannot rely on this if you have many censored data or your experimental time points are not sufficient. Basically, it gives you the least day (from the observation) that your data passing certain % mortality. In order to overcome this limitation, we provide predicted days at % mortality by using linear interpolation as well. 4. Mortality rate The ratio of total deaths to total population over specified period time. For example, a mortality rate of 10.5 in a population of 1,000,000 would mean 1050 deaths per unit time 5. Survival plot A graph that shows the relationship between the survival and age of a population of a particular species, plotting the number or population of organisms surviving at each age for a given group. The Kaplan-Meier or product-limit estimator was proposed for right censored analysis and it is the most common method of estimating the survival function S(t). where is the conditional probability of surviving during the j^th interval, d_j is the number of deaths and n_i is the size of the population at risk during the j^th interval. 6. Log cumulative hazard plot The cumulative hazard function (H), which is the risk of death at time t, was estimated by following formula. where is the Kaplan-Meier estimator. Mortality curve shows the change in mortality rates over specified period time, plotting the number of deaths at each age for a given group. Please note that the analysis of age-specific mortality rate by using this graph can cause biased results when the sample size is not sufficiently large (Promislow et al., 1999). Users should be cautious about interpreting the results if the survival data were obtained from small samples sizes.

Statistical testing between samples
	Input Format
		Input Format for statistical testing is same as that of basic survival analysis
	Ouput Format

	Test for the significance of difference in lifespan In a lifespan study, the comparisons of survival functions between experiment and control groups are important to determine the efficacy of the experimental treatments such as genetic manipulation, dietary intervention, or drug treatments. To systematically compare survival functions between experiment and control, we need to check various statistics in survival datasets because different conditions may increase or decrease lifespan in different ways. For example, some conditions could only increase the average lifespan, whereas others could increase both of average and maximum lifespan. Therefore, the statistics of overall lifespan is compared using log-rank test, whilst those of a specific time point is compared with Fisher's exact test. Based on comparisons of various statistics with overall lifespan, we can infer which condition reduces mortality caused by mid-life diseases or slow down fundamental processes of ageing. 1. Log-rank test Mantel-Cox test, so-called log-rank test, is a kind of nonparametric test which is frequently used for the comparison of two survival functions through overall lifespan data. The log-rank statistics in two groups such as experiment and control is calculated as follows. where d_i is the number of deaths in group 1, and e_i (estimated as (d_in_li)/n_i ) is the number of expected deaths in group 1. n_1i is the size of the population of group 1 at risk during the i^th interval, and ni is the total size of population at risk during the i^th interval. P-value 0.00E+00 is provided when P < 1.0 * 10^-10. 2. Weighted log-rank test One might want to put more emphasis on earlier deaths than the later ones or vice versa. To generalize log-rank test for these needs, Fleming and Harrington developed G(rho, gamma)-weighted log-rank test. The weighted test statistics is calculated by following equation. where w_i is vary according to the type of tests. Wilcoxon-Breslow-Gehan Test Wilcoxon test statistic is constructed by weighting the contribution of each failure time to the overall test statistic by the number of subjects at risk. Thus it gives heavier weights to earlier failure times when the number at risk is higher. As a result, this test is susceptible to differences in the censoring pattern of the groups. Tarone-Ware Test Test is suggested by Tarone and Ware (1977), with weights equal to the square root of the number of subjects in the risk pool at time ti. Like Wilcoxon’s test, this test is appropriate when hazard functions are thought to vary in ways other than proportionally and when censoring patterns are similar across groups. The test statistic is constructed by weighting the contribution of each failure time to the overall test statistic by the square root of the number of subjects at risk. Thus, like the Wilcoxon test, it gives heavier weights, although not as large, to earlier failure times. Although less susceptible to the failure and censoring pattern in the data than Wilcoxon’s test, this could remain a problem if large differences in these patterns exist between groups. Peto-Peto-Prentice Test The test uses as the weight function an estimate of the overall survivor function, which is similar to that obtained using the Kaplan–Meier estimator. This test is appropriate when hazard functions are thought to vary in ways other than proportionally, but unlike the Wilcoxon–Breslow–Gehan test, it is not affected by differences in censoring patterns across groups. Fleming-Harrington Test G(rho, gamma) weight defined as S(t)^rho(1-S(t))^gamma. Generally if rho > 0 and gamma = 0, the test is sensitive against early difference, whereas if rho = 0 and gamma > 0, the test is sensitive against later differences. P-value 0.00E+00 is provided when P < 1.0 * 10^-10. Maximal Lifespan comparison OASIS 2 has a new feature for proper quantification of differences in maximal lifespan between datasets. Maximal lifespan is an upper percentiles of the distribution of lifespan, which contrasts with mean lifespan. Maximal lifespan could be determined by "fundamental process of aging" whereas mean lifespan changes with various condition such as diseases. This is of interest because increasing maximal lifespan may be an indicator that an intervention is slowing the general processes of aging and not merely retarding the development of specific diseases. Thus, it could be useful to detect differences in maximal lifespan as opposed to simply “curve squaring” that can be induced by increasing mean or median lifespan without increasing maximal lifespan. 3. Boschloo's test Regarding comparison of maximal lifespan, Boschloo’s Test is used to compare the fractions of the longevity outliers. The null hypothesis of Boschloo’s test (H_0,A) is that the fraction of outliers, which live longer than specific-time points, is similar between population A and B. The equation of H_0,A is: where x is an observation from population. L(x) is the survival time of x. τ denotes some threshold chosen by the investigator that could represent the criteria for a specific time point. OASIS 2 provides 25, 50, 75, 90% percentile-threshold as τ. 4. Mann-Whitney U test The modified Mann-Whitney U test were able to determine the differences in the distribution tails of survival data affecting maximal lifespan as well as the differences in the proportion of longevity outliers. The null hypothesis of the test (H_0,AB) is compound of H_0,A and the another null hypothesis (H_0,B). H0,B is that the outlier have a similar average of survival time between two different populations. The equation of the compound null hypothesis (H_0,AB) is: where I(i) is indicator function taking on value of one if i is true and zero otherwise. Mann-Whitney U Test is applied to compare the average (μ) of Z variables between two populations. 5. Fisher's exact test Fisher's exact test is frequently used in survival analysis. To test different survival functions at specific time point instead of overall lifespan, the program can calculate the probability of observed data with Fisher's exact test at different time points as following formula. where a and b are the number of living subjects in group 1 and group 2 respectively and c and d are that of dead subjects in group 1 and group 2 respectively at specific time t. P-value of Fisher's exact test was calculated with the sum of probabilities less than or equal to p_t of all combinations. Generally, 90% mortality is used for Fisher's exact test. However, in some cases, comparisons between two datasets showed no statistically significant difference because of several reasons, including drastic death at old age. This analysis suggests that one might want to put more emphasis on earlier deaths than the later ones because later deaths might result from causes unrelated to normal ageing. 6. Kolmogorov-Smirnov test While the log-rank test is commonly used for comparing survival data between samples, it is optimized for special assumptions on the underlying distributions such that the hazard ratio or relative risk / is constant in time t. In that case, a log-rank test generally gives optimal results. However, for considering general situation, a generalized test that does not depend on a special underlying distribution is needed. The Kolmogorov-Smirnov test is an appropriate solution for this purpose so that it robustly works in the condition where the hazard functions and cross over through time t. The Kolmogorov-Smirnov test is based on the following equation. where sup represents a supremum of a set which gives smallest real number that is greater than or equal to every number in the set and D represents the largest absolute vertical deviation. OASIS 2 adopted surv2.ks function implemented in R packages (R Development Core Team and contributors worldwide, 2008) to provide Kolmogorov-Smirnov test. We note that the Kolmogorov-Smirnov test in OASIS 2 is not applicable to survival data that contain tied observations [eg. multiple events (death or failure) during an observed time interval]. OASIS 2 provides a warning message if there is any tied observation in survival data within or between samples. 7. Neyman's smooth test Another test capable of detecting a wide spectrum of alternatives is Neyman's smooth test. It is developed to test the homogeneity of two different survival data by comparing a null model, S₁(t) = S₂(t), and various alternative models. The alternative models embedded the null model with Legendre polynomials based on Neyman's goodness-of-fit idea as following equation. where is a parameter set of bounded functions which are modelling possible difference between S₁(t) and S₂(t). Therefore, if , then null hypothesis is accepted. Since the Neyman's smooth test selects optimal smooth model in Legendre polynomials with Schwarz's selection rule, it is different from Kolmogorov-Smirnov test in the respect of providing an idea of the types of difference between two survival data. The selected dimension represents type of difference between S₁(t) and S₂(t). If the selected dimension (d) is 1, it suggests that S₁(t) and S₂(t) are different from each other by the constant hazard ratio. If the selected d is 2, then the relationship between two samples is likely to be monotonic. If the selected d is 3, then relationship between two samples is likely to have convex or concave form. OASIS 2 adopted surv2.neyman function implemented in R packages to provide the Neyman's smooth test. Similar to the Kolmogorov-Smirnov test, the Neyman's smooth test is not currently applicable when there are tied observations in survival data. 8. Chow test Chow test, a variant of F-test, was invented by economist Gregory Chow to test whether the coefficients in two linear regressions on different data sets are same or not. This test is generally used for detecting structural break that is an unexpected shift in time series data. In the OASIS 2, we used this analysis for detecting structural differences between two different log cumulative hazard functions by using the following equation. where RSS_p represents the sum of squared residuals from the pooled log cumulative hazard data. RSS₁ and RSS₂ represent the sum of squared residuals from two different log cumulative hazard data respectively. N₁ and N₂ are the number of observation in each data and k is 3 which is the total number of parameters of linear regression model. The test statistic follows the F distribution with k and N₁ + N₂ - 2k degrees of freedom. OASIS 2 adopted chow.py function implemented by Dr. Ernesto P. Adorio in http://adorio-research.org/wordpress/?p=1789. 9. Survival time F-test We provide survival time F-test, which is used to examine whether two normal populations have same variance or not. Because censored data are generally used in survival analysis, one can estimate the number of dead animals using survival function S(t) and then perform F-test for the comparison of variance between two different survival data. The F-test is used under the condition that the survival times of individual follow normal distribution. As a normality check method for the given dataset, we provide the Shapiro-Wilk test in the OASIS 2 website. If a P-value from the Shapiro-Wilk test is smaller than 0.01, then the chance of survival data following the normal distribution is less than 0.01, and therefore the results of F-test are not applicable. Thus, we provide warning message in this case. 10. Partial slopes Ranksum test We devised another statistical test method for comparing the difference of slope of two log cumulative hazard plots. We calculated partial slopes of the log cumulative hazard plot. With null hypothesis that two different log cumulative hazard plots have same slopes, we conducted rank-sum tests with set of partial slopes as following definitions. The partial slopes rank-sum test is based on non-parametric statistics that requires sufficient number of samples (in this case, partial slopes) for the reliable analysis. Since a partial slope is defined as the changes in log cumulative hazard divided by the corresponding change in survival time between two neighbouring time points, the number of observed time points rather than the total sample size is important for the non-parametric analysis. To obtain statistically meaningful results, at least six observed time points are needed. ,where D₁ and D₂ are set of partial slopes of each group. These sets are compared with ranksum test. 11. Normalized Chow test Chow test is used for testing whether the coefficients in two linear regressions on different datasets are same or not. However, researchers who perform survival analysis tend to be interested in examining the difference in slope rather than in determining the difference in y-intersect. For this purpose, before conducting Chow test, we normalized the log cumulative hazard data to have a mean of zero. In this case, the linear regression of each dataset has zero y-intersect. Thus, one can test the difference of the slope of each dataset and the pooled data. We verified the difference of lifespan variations through normalized Chow test, a statistical test that examines whether the coefficients of two linear regressions on different normalized data sets are equal. Like log-rank test, the assumption is that survival rate is constant over time to apply the normalized Chow test.

Cox proportional hazards regression
	Input Format

		OASIS 2 takes following format of input data for Cox proportional hazards regression.
		% Name of fields (Required) Survival Data (Required, The data should be Tab-Delimited) First column should be time (Required) Second column should be the status of events (Required) Other columns are considered as the values of risk factors (At least one column is required)
	Ouput Format

	Evaluating the effect of several risk factors OASIS 2 provides Cox proportional hazards regression which can evaluate the effect of several risk factors such as sex, age, and weight on survival on survival. By considering that hazard function such as mortality rate can be explained by the proportional sum of risk factors, Cox formulated semi-parametric model with following equation. where represent k risk factors which are assumed to act independently, are their regression coefficients, h₀(t) is the baseline hazard at time t, and i is a subscript for observation. To find risk factors that can explain hazard function with proportion, the input data format should be different from that of survival analysis. Therefore, we made a separate input form for Cox proportional hazards regression. 1. Partial likelihood estimator OASIS 2 provides two kinds of regression methods using partial likelihood estimator (PLE) and robust estimator through coxr function in R package. The PLE is generally used to estimate the regression coefficients by maximizing the partial likelihood where is a k x 1 vector of regression coefficients, X_i is a k x 1 vector of risk factors, and ti is a observed time of event. To estimate , which maximizes the partial likelihood, Cox's estimator solves the following score function that is the derivative with respect to of the log partial likelihood function. where t_i is a observed time of event and equals 1 for the deaths observed and 0 otherwise. 2. Robust estimator While the PLE is commonly used for Cox proportional hazards regression, it is known to be sensitive to outliers and the deflection of underlying assumption. Especially the PLE method is strongly influenced by large values of t exp(^TX). To reduce the influence of large values of t exp(^TX), Minder and Bednarski introduced smooth modification to the score function of the partial likelihood method, where smooth function A(t_i, X_i) is defined as A(t_i, X_i) = M-min( M, t_i exp(^TX_i)) and M is a 95th percentile of the samples {t₁ exp(^TX₁), ..., t_n exp(^TX_n)}. The smooth functions in the outer and inside sum have an reducing effect on large values of t exp(^TX) and ^TX respectively. OASIS 2 adopted coxr function implemented in R packages (R Development Core Team and contributors worldwide, 2009) to provide the robust estimation. In the coxr function, a better solution for smooth function A(t_i, X_i) is used by using (t)exp(^TX) instead of heuristic t exp(^TX), where (t) is -logS(t).

References

Description	Reference	Url
Kaplan-Meier Estimator	-	http://en.wikipedia.org/wiki/Kaplan-Meier_estimator
Survival Function	-	http://en.wikipedia.org/wiki/Survival_function
Examples	-	http://www.weibull.com/LifeDataWeb/nonparametric_analysis.htm
Examples	-	http://www.statsdirect.com/help/survival_analysis/kaplan.htm
Wilcoxon signed-rank test	-	http://faculty.vassar.edu/lowry/wilcoxon.html
Wilcoxon rank sum test	-	http://elegans.swmed.edu/~leon/stats/utest.html
Maximal lifespan	-	https://en.wikipedia.org/wiki/Maximum_life_span
Maximal lifespan	Wang et al., 2004	http://www.sciencedirect.com/science/article/pii/S0047637404001496
Boschloo's test	-	https://cran.r-project.org/web/packages/Exact/Exact.pdf
Boschloo's test	-	http://oai.cwi.nl/oai/asset/8128/8128A.pdf
Modified Mann-Whitney U Test	Gao et al., 2008	https://bmcmedresmethodol.biomedcentral.com/articles/10.1186/1471-2288-8-49
Fisher's exact test	-	http://www.socr.ucla.edu/htmls/ana/FishersExactTest_Analysis.html
Fisher's exact test	-	http://faculty.vassar.edu/lowry/fisher.html
Fisher's exact test	-	http://www.langsrud.com/fisher.htm
Log Rank Test	-	http://bioinf.wehi.edu.au/software/russell/logrank/index.html
Kaplan-Meier survival function	-	http://www.hutchon.net/Kaplan-Meier.htm
SURVSOFT	Geiss K et al.,2009	http://www.krebsregister-bayern.de/software_e.html
surv2sample	Kraus D, 2007	http://www.davidkraus.net/surv2sample/
coxrobust	Bednarski T et al., 2006	http://cran.r-project.org/web/packages/coxrobust/index.html
Chow test	Dr. Ernesto P. Adorio	http://adorio-research.org/wordpress/?p=1789
Weighted log-rank test	-	http://www.stata.com/manuals13/stststest.pdf

Promislow DEL, Tatar M, Pletcher SD, Carey JR (1999) Below-threshold mortality: implications for studies in evolution, ecology and demography. J. Evol. Biol. 12, 314-328.

Structural Bioinformatics Lab.

Summary

Download

Basic Survival Analysis

Statistical testing methods

Cox proportional hazards regression

Basic survival analysis

Input Format

Output Format

Statistics for the estimation of observed survival

Statistical testing between samples

Input Format

Ouput Format

Test for the significance of difference in lifespan

Maximal Lifespan comparison

Cox proportional hazards regression

Input Format

Ouput Format

Evaluating the effect of several risk factors

References