The SADI package provides tools for sequence analysis, which focuses on the similarity and dissimilarity between categorical time series such as life-course trajectories. SADI‘s main components are tools to calculate intersequence distances using several different algorithms, including the optimal matching algorithm, but it also includes utilities to graph, summarize, and manage sequence data. It provides similar functionality to the R package TraMineR and the Stata package SQ but is substantially faster than the latter.
AbbottA., and ForrestJ.1986. Optimal matching methods for historical sequences. Journal of Interdisciplinary History16: 471–494.
2.
BargemanB., JohC.-H., and TimmermansH.2002. Vacation behavior using a sequence alignment method. Annals of Tourism Research29: 320–337.
3.
BlanchardP., BühlmannF., and GauthierJ.-A., eds. 2014. Advances in Sequence Analysis: Theory, Method, Applications.Berlin: Springer.
4.
Brzinsky-FayC., KohlerU., and LuniakM.2006. Sequence analysis with Stata. Stata Journal6: 435–460.
5.
CornwellB.2015. Social Sequence Analysis: Methods and Applications.New York: Cambridge University Press.
6.
ElzingaC. H.2007. Sequence analysis: Metric representations of categorical time series. Technical report, Department of Social Science Research Methods, Vrije Universiteit, Amsterdam.
7.
ElzingaC. H., and LiefbroerA. C.2007. De-standardization of family-life trajectories of young adults: A cross-national comparison using sequence analysis. European Journal of Population23: 225–250.
8.
ForestierG., LalysF., RiffaudL., TrelhuB., and JanninP.2012. Classification of surgical processes using dynamic time warping. Journal of Biomedical Informatics45: 255–264.
9.
GabadinhoA., RitschardG., StuderM., and MüllerN. S.2009. Mining sequence data in R with the TraMineR package: A user's guide for version 1.2. Technical report, Department of Econometrics and Laboratory of Demography, University of Geneva, Switzerland.
10.
HalpinB.2010. Optimal matching analysis and life-course data: The importance of duration. Sociological Methods and Research38: 365–388.
11.
HalpinB.2012. Sequence analysis of life-course data: A comparison of distance measures. Working Paper WP2012-02, Department of Sociology, University of Limerick. http://www.ul.ie/sociology/pubs/wp2012-02.pdf.
12.
HalpinB.2013. Sequence analysis. In Oxford Bibliographies in Sociology, ed. BaxterJ.New York: Oxford University Press.
13.
HalpinB.2014. Three narratives of sequence analysis. In Advances in Sequence Analysis: Theory, Method, Applications, ed. BlanchardP., BühlmannF., and GauthierJ.-A., 75–103. Berlin: Springer.
14.
HalpinB.2016a. Cluster analysis stopping rules in Stata. Working Paper WP2016-01, Department of Sociology, University of Limerick. https://osf.io/rjqe3.
15.
HalpinB.2016b. Multiple imputation for categorical time series. Stata Journal16: 590–612.
16.
HalpinB., and ChanT. W.1998. Class careers as sequences: An optimal matching analysis of work-life histories. European Sociological Review14: 111–130.
17.
HollisterM.2009. Is optimal matching suboptimal?Sociological Methods and Research38: 235–264.
18.
HubertL., and ArabieP.1985. Comparing partitions. Journal of Classification2: 193–218.
19.
JannB.2005. moremata: Stata module (Mata) to provide various functions. Statistical Software Components S455001, Department of Economics, Boston College. https://ideas.repec.org/c/boc/bocode/s455001.html.
20.
LesnardL.2008. Off-scheduling within dual-earner couples: An unequal and negative externality for family time. American Journal of Sociology114: 447–490.
MarteauP.-F.2009. Time warp edit distance with stiffness adjustment for time series matching. IEEE Transactions on Pattern Analysis and Machine Intelligence31: 306–318.
23.
MartinP., SchoonI., and RossA.2008. Beyond transitions: Applying optimal matching analysis to life course research. International Journal of Social Research Methodology11: 179–199.
24.
McVicarD., and Anyadike-DanesM.2002. Predicting successful and unsuccessful transitions from school to work by using sequence methods. Journal of the Royal Statistical Society, Series A165: 317–334.
25.
RaabM., FasangA. E., KarhulaA., and ErolaJ.2014. Sibling similarity in family formation. Demography51: 2127–2154.
26.
ReillyC., WangC., and RutherfordM.2005. A rapid method for the comparison of cluster analyses. Statistica Sinica15: 19–33.
27.
StuderM., and RitschardG.2016. What matters in differences between life trajectories: A comparative review of sequence dissimilarity measures. Journal of the Royal Statistical Society, Series A179: 481–511.
28.
StuderM., RitschardG., GabadinhoA., and MüllerN. S.2011. Discrepancy analysis of state sequences. Sociological Methods and Research40: 471–510.
29.
VinhN. X., EppsJ., and BaileyJ.2009. Information theoretic measures for clusterings comparison: Is a correction for chance necessary? In Proceedings of the Twenty-Sixth International Conference on Machine Learning, ed. BottouL., and LittmanM., 1073–1080. Montreal, Canada: IMLS.