MARC Record from marc_columbia

Record ID	marc_columbia/Columbia-extract-20221130-032.mrc:13002180:3425
Source	marc_columbia
Download Link	/show-records/marc_columbia/Columbia-extract-20221130-032.mrc:13002180:3425?format=raw

LEADER: 03425cam a22003733i 4500
001 15539211
005 20210729220235.0
006 m o d
007 cr |n||||a||||
008 210707s2021 nyu|||| om 00| ||eng d
035 $a(OCoLC)1261044879
035 $a(OCoLC)on1261044879
035 $a(NNC)ACfeed:legacy_id:ac:2rbnzs7h89
035 $a(NNC)ACfeed:doi:10.7916/d8-k9kk-5e95
035 $a(NNC)15539211
040 $aNNC$beng$erda$cNNC
100 1 $aWang, Zhi.
245 10 $aStatistical Learning for Process Data /$cZhi Wang.
264 1 $a[New York, N.Y.?] :$b[publisher not identified],$c2021.
336 $atext$btxt$2rdacontent
337 $acomputer$bc$2rdamedia
338 $aonline resource$bcr$2rdacarrier
300 $a1 online resource.
502 $aThesis (Ph.D.)--Columbia University, 2021.
500 $aDepartment: Statistics.
500 $aThesis advisor: Jingchen Liu.
500 $aThesis advisor: Zhiliang Ying.
520 $aComputer-based tests facilitate the collection of problem-solving processes, also known as process data. Response processes recorded in computer log files provide a new venue for investigating and understanding human behaviors. This thesis focuses on the development of statistical learning methods for process data and considers the following three problems. The first problem is feature extraction. Response processes are noisy and of non-standard formats. To exploit information in process data, we propose two generic methods that summarize response processes to vectors so that standard statistical tools such as regression models are applicable. In Chapter 2, features are extracted using multidimensional scaling and a pairwise dissimilarity measure of response processes. Chapter 3 utilizes autoencoder and recurrent neural network to explore the latent structure of process data. For both methods, empirical studies show that the extracted features preserve a substantial amount of information in the observed processes and have greater predictive power for many variables than the traditional item responses.
520 $aThe second problem is assessment based on process data. We present a statistical procedure in Chapter 4 that incorporates process information to improve the latent trait estimation of item response theory models. The procedure is data-driven and can be easily implemented by means of regression models. Theoretical guarantee is established for the mean squared error reduction. Application of this new process-data-based estimator to a real dataset shows that it achieves higher reliability than the traditional item-response-theory-based estimator. The third problem is identification of problem-solving strategies for exploratory analysis. The approach presented in Chapter 5 segments individual process into a sequence of more homogeneous subprocesses using action predictability. Each subprocess is associated with a subtask whereby long and complex response process can be transformed into shorter and more interpretable subtask sequence. Using this approach, problem-solving strategies can be visualized and compared among groups of respondents and process information can be decomposed for further analysis.
653 0 $aStatistics
653 0 $aComputer-assisted instruction
653 0 $aProblem solving
856 40 $uhttps://doi.org/10.7916/d8-k9kk-5e95$zClick for full text
852 8 $blweb$hDISSERTATIONS