Session Title
Session 3-3-C: Research Perspectives
Presentation Type
Paper Presentation
Location
Park MGM, Las Vegas, NV
Start Date
25-5-2023 1:30 PM
End Date
25-5-2023 3:00 PM
Disciplines
Applied Statistics | Data Science | Numerical Analysis and Scientific Computing | Statistical Methodology
Abstract
Abstract:
A common difficulty when researching gambling topics is the availability of high-quality data sets for development and testing. Due to the high level of secrecy within the gambling industry, if data is obtained for research purposes it is often prohibitively obfuscated, incomplete, or aggregated. Although these data have allowed for advancement in academic work, it leaves both the researchers and readers left wondering about what would be possible if more detailed data sets were available. To mitigate the paucity of data available to researchers, we present a Markov chain-based statistical process for producing artificial event data for a simulated slot floor, which is realistic and statistically similar to the real source data used to generate it. Methods based on a transition probability matrix estimation are introduced and tested on an anonymous source data set. The process can accurately replicate event data and resultant session data distributions well, producing a robust artificial data set that can be used for research purposes.
Implication Statement:
Lack of detailed gambling data impedes both research and the rapid development of new gambling technologies by students, researchers, and entrepreneurs. The methods presented offer a solution by creating complete, statistically robust artificial slot data that can be used for research and development.
Keywords
Casinos, slot floor, data generation, artificial data, simulations, Markov chains
Funding Sources
This work was supported by funding from nQube Data Science Inc., Mitacs, and the University of Manitoba.
Competing Interests
Courtney Bonner - There are no competing interests. Jason Fiege and Anastasia Baran - In the last three years nQube has received funding from The National Research Council of Canada Industrial Research Assistance Program (NRC IRAP) for work completely unrelated to this research. There are no competing interests. Saman Muthukumarana - There are no competing interests.
Included in
Applied Statistics Commons, Data Science Commons, Numerical Analysis and Scientific Computing Commons, Statistical Methodology Commons
Statistical methods to generate artificial slot floor data for the advancement of casino related research
Park MGM, Las Vegas, NV
Abstract:
A common difficulty when researching gambling topics is the availability of high-quality data sets for development and testing. Due to the high level of secrecy within the gambling industry, if data is obtained for research purposes it is often prohibitively obfuscated, incomplete, or aggregated. Although these data have allowed for advancement in academic work, it leaves both the researchers and readers left wondering about what would be possible if more detailed data sets were available. To mitigate the paucity of data available to researchers, we present a Markov chain-based statistical process for producing artificial event data for a simulated slot floor, which is realistic and statistically similar to the real source data used to generate it. Methods based on a transition probability matrix estimation are introduced and tested on an anonymous source data set. The process can accurately replicate event data and resultant session data distributions well, producing a robust artificial data set that can be used for research purposes.
Implication Statement:
Lack of detailed gambling data impedes both research and the rapid development of new gambling technologies by students, researchers, and entrepreneurs. The methods presented offer a solution by creating complete, statistically robust artificial slot data that can be used for research and development.