帳號:guest(18.218.2.231)          離開系統
字體大小: 字級放大   字級縮小   預設字形  

詳目顯示

以作者查詢圖書館館藏以作者查詢臺灣博碩士論文系統以作者查詢全國書目
作者(中文):廖翊廷
作者(外文):Liao, Yi-Ting
論文名稱(中文):核密度估計之距離相關係數
論文名稱(外文):Distance Correlations of Kernel Density Estimators
指導教授(中文):黃禮珊
指導教授(外文):Huang, Li-Shan
口試委員(中文):林千代
江金倉
學位類別:碩士
校院名稱:國立清華大學
系所名稱:統計學研究所
學號:111024518
出版年(民國):113
畢業學年度:112
語文別:中文
論文頁數:61
中文關鍵詞:距離共變異數核密度估計相關係數非線性相關性距離相關係數距離變異數
外文關鍵詞:distance covariancekernel density estimatorcorrelationnonlinear correlationdistance correlationdistance variance
相關次數:
  • 推薦推薦:0
  • 點閱點閱:17
  • 評分評分:*****
  • 下載下載:0
  • 收藏收藏:0
本論文主要探討核密度估計方法下距離相關係數的應用,並對其進行模擬與實際數據驗證。距離相關係數是一種用於衡量兩組多維隨機變數之間相關性的統計指標。傳統的皮爾森相關係數只能檢測線性關係,而距離相關係數則能夠檢測到更廣泛的非線性相關。核密度估計是一種無母數曲線估計方法,通過選取欲估計點周圍的樣本進行曲線配適,距離離估計點越近的樣本權重越大。核密度估計的核心概念在於利用加權過後的樣本來估計機率密度函數。本論文使用高斯核函數進行估計,並將其結果用於估計變數的聯合機率密度函數及邊際機率密度函數,從而得到估計後的聯合特徵函數及邊際特徵函數。本論文首先對距離相關係數進行了理論探討,並定義了相關的數學公式。接著,進行了一系列模擬實驗,包括生成高度相關及低度相關的數據,並計算出核密度估計距離相關係數、樣本距離相關係數與皮爾森相關係數進行比較。模擬結果顯示,與距離相關係數的統計指標在檢測非線性相關方面具有優越性。最後,將核密度估計距離相關係數應用於實際數據,驗證其在不同資料集上的適用性。
This thesis primarily explores the application of the distance correlation coefficient for kernel density estimators (KDE), with simulations and real data applications. The distance correlation coefficient is a statistical measure used to assess the association between two sets of multidimensional random variables. Unlike the traditional Pearson correlation coefficient, which can only detect linear relationships, the distance correlation coefficient can identify a broader range of nonlinear associations. KDE is a non-parametric density estimation method that fits a curve using data around the point to be estimated, with closer data points receiving more weight. This study employs Gaussian kernel functions for estimation and uses the results to estimate the joint and marginal probability density functions of the variables, thereby obtaining the estimated joint characteristic function and marginal characteristic function. The thesis first theoretically reviews the distance correlation coefficient and defines the relevant mathematical notation. Subsequently, a series of simulation experiments are conducted, including generating data with high and low correlations, and calculating the KDE-based distance correlation coefficients, the sample distance correlation coefficients, and the Pearson correlation coefficients for comparison. The simulation results demonstrate that the distance correlation coefficient is superior in detecting nonlinear correlations. Finally, the KDE-based distance correlation coefficient is applied to mortality data to verify its applicability across different datasets. (The Chinese abstract was written by myself and translated into English version using ChatGPT. Both my professor and I have revised the translated abstract.)
目錄
1 介紹 4
1.1 前言 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.2 研究動機 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
2 背景、定義及定理 5
2.1 核函數估計 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
2.2 距離相關係數 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
3 方法 9
3.1 KDE估計性質與其特徵函數 . . . . . . . . . . . . . . . . . . . . . . 9
3.2 利用二維核密度估計計算的距離共變異數 . . . . . . . . . . . . . . 12
4 模擬 15
4.1 模擬步驟介紹 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
4.2 模擬結果比較 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
4.3 統計量的表現 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50
5 實際數據應用 52
6 結論 60
參考文獻 61
Billingsley, P. (1986), Probability and Measure, second edn, John Wiley and Sons.
de Laplace, P.-S. M. (1820), Th ́eorie analytique des probabilit ́es, Oeuvres compl`etes, Courcier.
Duong, T. (2007), ‘ks: Kernel density estimation and kernel discriminant analysis for multivariate data in r’, Journal of Statistical Software 21(7), 1–16.
Herbert, R. (1956), An empirical bayes approach to statistics, in ‘Proceedings of the third berkeley symposium on mathematical statistics and probability’, Vol. 1, pp. 157–163.
Parzen, E. (1962), ‘On estimation of a probability density function and mode’, The Annals of Mathematical Statistics 33(3), 1065–1076.
Pearson, K. (1901), ‘Liii. on lines and planes of closest fit to systems of points in space’, The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science 2(11), 559–572.
Rosenblatt, M. (1956), ‘Remarks on Some Nonparametric Estimates of a Density Function’, The Annals of Mathematical Statistics 27(3), 832 – 837.
Sklar, M. (1959), ‘Fonctions de r ́epartition `a n dimensions et leurs marges’, Publ. Inst. Statist. Univ. Paris 8, 229–231.
Sz ́ekely, G. J. and Rizzo, M. L. (2023), The Energy of Data and Distance Correlation (1st ed.), Chapman and Hall/CRC.
Sz ́ekely, G. J., Rizzo, M. L. and Bakirov, N. K. (2007), ‘Measuring and testing dependence by correlation of distances’, The Annals of Statistics 35(6), 2769 – 2794
(此全文20290730後開放外部瀏覽)
電子全文
摘要
 
 
 
 
第一頁 上一頁 下一頁 最後一頁 top
* *