Baseball, P. (2000). Within the P. Baseball, H. F. Spirer, & L. Spirer (Eds.), Making the Situation: Exploring Large-scale Human Rights Violations Having fun with Guidance Systems and Studies Study. AAAS.
Belin, T. Roentgen., & Rubin, D. B. (1995). A method having calibrating not true-suits costs from inside the record linkage. Log of the American Analytical Organization, 90(430), 694–707.
Bilenko, Yards., & Mooney, Roentgen. J. (2003). Adaptive Copy Recognition Having fun with Learnable Sequence Resemblance Steps. During the KDD ’03 (pp. 39–48). ACM.
Christen, P. (2008). Automated Number Linkage Having fun with Seeded Nearest Neighbor and you can Help Vector Servers Group. Into the KDD ’08 (pp. 151–159). ACM.
Christen, P. (2012). A study of indexing methods for scalable number linkage and you may deduplication. IEEE Deals for the Training and you may Analysis Technologies, 24(9), 1537–1555.
Cohen, W., Raviku). An assessment out-of sequence metrics to have complimentary brands and you can records. During the KDD working area to the analysis clean up and target consolidation (Vol. 3, pp. 73–78).
Copas, J., & Hilton, F. (1990). Record linkage: Analytical patterns having matching computer records. Diary of the Regal Mathematical Neighborhood, Series Good, 153(3), 287–320.
Dai, A beneficial. M., & Storkey, An excellent. J. (2011). Brand new grouped publisher-thing model having unsupervised entity quality. During the Fake sensory networking sites and you will machine reading–icann 2011 (pp. 241–249). Springer.
Fortini, Yards., Liseo, B., Nuccitelli, A great., & Scanu, M. (2001). Into Bayesian Number Linkage. Lookup when you look at the Certified Statistics, 4(1), 185–198.
Gutman, Roentgen., Afendulis, C., & Zaslavsky, An excellent. (2013). An effective bayesian means of file connecting to research stop- of-lives medical will set you back. Journal of the American Analytical Association, 108(501), 34–47.
Hsu, W., Lee, Meters. L., Liu, B., & Ling, T. W. (2000). Exploration Exploration within the Diabetics Database: Findings and you can Results. When you look at the KDD ’00 (pp. 430–436). ACM.
A torn-merge Markov strings Monte Carlo process of the brand new Dirichlet process blend design
Jewell, Letter. P., Spagat, Yards., & Jewell, B. L. (2013). MSE and you can Casualty Counts: Presumptions, Translation, and Demands. In the T. B. Seybolt, J. D. Aronson, & B. Fischhoff (Eds.), Depending Civil Casualties: An introduction to Tape and you may Quoting Nonmilitary Fatalities in conflict. Oxford, UK: Oxford School Force.
Larsen, M. D. (2002)ments into Hierarchical Bayesian List Linkage. Inside the Procedures of your joint statistical group meetings, section on survey browse actions (pp. 1995–2000). The fresh new American Statistical Association.
Larsen, Yards. D. (2005). Improves inside Checklist Linkage Concept: Hierarchical Bayesian Checklist Linkage Concept. Into the Procedures of one’s combined mathematical group meetings, part with the questionnaire lookup strategies (pp. 3277–3284). The fresh new Western Statistical Connection.
Larsen, Meters. D., & Rubin, D. B. (2001). Iterative automated number linkage playing with blend activities. Log of American Statistical Connection, 96(453), 32–41.
Lum, K., Speed, M. E., & Banking institutions, D. (2013). Software off Several Options Estimation for the Peoples Rights Look. The brand new Western Statistician, 67(4), 191–two hundred.
Marchant, Letter. G., C., Kaplan, A good., Rubinstein, B. We. P., & Elazar, D. Letter. (2019). D-blink: Distributed end-to-stop bayesian organization resolution.
McCallum, An excellent., & Wellner, B. (2004). Conditional Varieties of Label Suspicion having Application to help you Noun Coreference. When you look at the Advances in the neural advice running solutions (nips ’04) (pp. 905–912). MIT Drive.
Miller, P. L., Frawley, S. J., & Sayward, F. G. (2000). IMM/Scrub: A domain name-Specific Equipment towards Deduplication out-of Inoculation Records Ideas into the Childhood Immunization Registriesputers and you may Biomedical Look, 33(2), 126–143.
Murphy, J., Brackbill, Roentgen. M., Thalji, L., Dolan, Meters., Pulliam, P., & Walker, D. J. (2007). Measuring and you may Enhancing Batam women in us Exposure in the world Trading Heart Fitness Registry. Analytics from inside the Drug, 26(8), 1688–1701.
Murray, J. S. (2016). Probabilistic listing linkage and deduplication shortly after indexing, blocking, and filtering. Record regarding Privacy and you will Confidentiality, 7(1), 3–24.
Newcombe, H. B., Kennedy, J. Meters., Axford, S. J., & James, An excellent. P. (1959). Automatic linkage regarding public information machines are often used to extract” follow-up” statistics from family members out-of documents away from routine suggestions. Research, 130(3381), 954–959.
Sadinle, Meters. (2014). Finding Copies when you look at the a murder Registry Having fun with good Bayesian Partitioning Means. Annals regarding Used Statistics, 8(4), 2404–2434.
Sariyar, Yards., Borg, A., & Pommerening, K. (2012). Active Studying Suggestions for the newest Deduplication off Digital Diligent Research Using Group Trees. Log out-of Biomedical Informatics, 45(5), 893–900.
C., Hallway, Roentgen., & Fienberg, S. Age. (2016). An excellent Bayesian Method of Graphical List Linkage and Deduplication. Log of the American Analytical Organization, 111(516), 1660–1672.
Tancredi, A beneficial., & Liseo, B. (2011). An effective hierarchical Bayesian method to list linkage and you may populace size difficulties. Annals out of Applied Analytics, 5(2B), 1553–1585.