Makine Öğrenmesinde Açıklanabilirlik Kavramı: Derin Pekiştirmeli Öğrenme ve Sembolik Regresyon ile Hibrit Yaklaşım

Volkan Eteman

doi:10.37609/akya.3349

Makine Öğrenmesinde Açıklanabilirlik Kavramı: Derin Pekiştirmeli Öğrenme ve Sembolik Regresyon ile Hibrit Yaklaşım

Yazarlar

Volkan Eteman

https://orcid.org/0000-0002-3430-7073

DOI: https://doi.org/10.37609/akya.3349

Özet

Referanslar

Agostinelli, F., Hocquet, G., Singh, S., & Baldi, P. (2018). From Reinforcement Learning to Deep Reinforcement Learning: An Overview: International Conference Commemorating the 40th Anniversary of Emmanuil Braverman’s Decease, Boston, MA, USA, April 28-30, 2017, Invited Talks (ss. 298-328). https://doi.org/10.1007/978-3-319-99492-5_13

Alagoz, O., Hsu, H., Schaefer, A. J., & Roberts, M. S. (2010). Markov Decision Processes: A Tool for Sequential Decision Making under Uncertainty. Medical decision making : an international journal of the Society for Medical Decision Making, 30(4), 474-483. https://doi.org/10.1177/0272989X09353194

Aldeia, G. S. I., & de França, F. O. (2021). Measuring feature importance of symbolic regression models using partial effects. Proceedings of the Genetic and Evolutionary Computation Conference, 750-758. https://doi.org/10.1145/3449639.3459302

Austel, V., Dash, S., Gunluk, O., Horesh, L., Liberti, L., Nannicini, G., & Schieber, B. (2017). Globally Optimal Symbolic Regression (arXiv:1710.10720). arXiv. https://doi.org/10.48550/arXiv.1710.10720

Aytaç, M. (2015). Markov Karar Analizi. İçinde Karar Verme (1. bs).

Barfuss, W., & Meylahn, J. M. (2023). Intrinsic fluctuations of reinforcement learning promote cooperation. Scientific Reports, 13(1), Article 1. https://doi.org/10.1038/s41598-023-27672-7

Barron, A. B., Hebets, E. A., Cleland, T. A., Fitzpatrick, C. L., Hauber, M. E., & Stevens, J. R. (2015). Embracing multiple definitions of learning. Trends in Neurosciences, 38(7), 405-407. https://doi.org/10.1016/j.tins.2015.04.008

Barto, A. G., Sutton, R. S., & Anderson, C. W. (1983). Neuronlike adaptive elements that can solve difficult learning control problems. IEEE transactions on systems, man, and cybernetics, 5, 834-846.

Basheer, I. A., & Hajmeer, M. (2000). Artificial neural networks: Fundamentals, computing, design, and application. Journal of Microbiological Methods, 43(1), 3-31. https://doi.org/10.1016/S0167-7012(00)00201-3

Bellman, R. (1952). On the Theory of Dynamic Programming. Proceedings of the National Academy of Sciences, 38(8), 716-719. https://doi.org/10.1073/pnas.38.8.716

Bellman, R. (1954). The theory of dynamic programming. Bulletin of the American Mathematical Society, 60(6), 503-515. https://doi.org/10.1090/S0002-9904-1954-09848-8

Bellman, R. (1957). Dynamic Programming (First Edition). Princeton University Press.

Bellman, R. (1961). Adaptive Control Processes: A Guided Tour. Princeton University Press. https://www.jstor.org/stable/j.ctt183ph6v

Bengio, Y. (2007). On the challenge of learning complex functions. İçinde P. Cisek, T. Drew, & J. F. Kalaska (Ed.), Progress in Brain Research (C. 165, ss. 521-534). Elsevier. https://doi.org/10.1016/S0079-6123(06)65033-4

Bengio, Y. (2012). Practical recommendations for gradient-based training of deep architectures. Arxiv.

Boguslavskiy, J. (2016). Polynomial Approximation Technique Applied to Inverse Vector-Function (ss. 45-70). https://doi.org/10.1007/978-3-319-04036-3_4

Borsa İstanbul A.Ş. (2014). https://www.borsaistanbul.com/tr/duyuru/683/viopta-fix-protokolu-ile-emir-iletim-altyapisi-devreye-alindi

Brown, R. (2007). The Lıfe And Work Of Donald Oldıng Hebb, Canada’s Greatest Psychologıst. Proceedings of the Nova Scotian Institute of Science (NSIS), 44. https://doi.org/10.15273/pnsis.v44i1.3880

Brownlees, C. T., Cipollini, F., & Gallo, G. M. (2010). Intra-Daily Volume Modeling and Prediction for Algorithmic Trading (SSRN Scholarly Paper 1393993). https://doi.org/10.2139/ssrn.1393993

Brunton, S. L., & Kutz, J. N. (2019). Data-Driven Science and Engineering: Machine Learning, Dynamical Systems, and Control (1st edition). Cambridge University Press.

Carabantes, M. (2020). Black-box artificial intelligence: An epistemological and critical analysis. AI & SOCIETY, 35(2), 309-317. https://doi.org/10.1007/s00146-019-00888-w

Chaboud, A. P., Chiquoine, B., Hjalmarsson, E., & Vega, C. (2014). Rise of the Machines: Algorithmic Trading in the Foreign Exchange Market. The Journal of Finance, 69(5), 2045-2084. https://doi.org/10.1111/jofi.12186

Clark, W. A., & Farley, B. G. (1955). Generalization of pattern recognition in a self-organizing system. Proceedings of the March 1-3, 1955, western joint computer conference, 86-91. https://doi.org/10.1145/1455292.1455309

De Houwer, J., Barnes-Holmes, D., & Moors, A. (2013). What is learning? On the nature and merits of a functional definition of learning. Psychonomic Bulletin & Review, 20(4), 631-642. https://doi.org/10.3758/s13423-013-0386-3

Domowitz, I., & Yegerman, H. (2006). The Cost of Algorithmic Trading: A First Look at Comparative Performance. The Journal of Trading, 1, 33-42. https://doi.org/10.3905/jot.2006.609174

Ende, B., Gomber, P., & Lutat, M. (2009). Smart Order Routing Technology in the New European Equity Trading Landscape (SSRN Scholarly Paper 1476272). https://papers.ssrn.com/abstract=1476272

European Commission. (2010). Review of the Markets in Financial Instruments Dırective(MIFID).

Goertzel, B. (2014). Artificial General Intelligence: Concept, State of the Art, and Future Prospects. Journal of Artificial General Intelligence, 5(1), 1-48. https://doi.org/10.2478/jagi-2014-0001

Goldberg, D. E. (1989). Genetic Algorithms in Search, Optimization and Machine Learning (1st bs). Addison-Wesley Longman Publishing Co., Inc.

Gomber, P., Arndt, B., Lutat, M., & Uhle, T. (2011). High-Frequency Trading (SSRN Scholarly Paper 1858626). https://doi.org/10.2139/ssrn.1858626

Gomber, P., & Gsell, M. (2006). Catching Up with Technology—The Impact of Regulatory Changes on ECNs/MTFs and the Trading Venue Landscape in Europe. Competition and Regulation in Network Industries, 7, 535-558. https://doi.org/10.1177/178359170600100401

Gomber, P., & Zimmermann, K. (2018). Algorithmic trading in practice. İçinde The Oxford Handbook of Computational Economics and Finance (ss. 311-332). https://doi.org/10.1093/oxfordhb/9780199844371.013.12

Groth, S. S. (2011). Does Algorithmic Trading Increase Volatility? Empirical Evidence from the Fully-Electronic Trading Platform Xetra. Wirtschaftsinformatik. https://www.semanticscholar.org/paper/Does-Algorithmic-Trading-Increase-Volatility-from-Groth/4991b9818bed702f65ca22352fd344d718a2f053

Grundner, A., Beucler, T., Gentine, P., & Eyring, V. (2023). Data-Driven Equation Discovery of a Cloud Cover Parameterization (arXiv:2304.08063). arXiv. https://doi.org/10.48550/arXiv.2304.08063

Gsell, M. (2008). Assessing the impact of algorithmic trading on markets: A simulation approach. 16th European Conference on Information Systems, ECIS 2008.

Gsell, M., & Gomber, P. (2009). Algorithmic Trading Engines Versus Human Traders – Do They Behave Different in Securities Markets? (SSRN Scholarly Paper 1476271). https://papers.ssrn.com/abstract=1476271

Gustafson, S., Burke, E. K., & Krasnogor, N. (2005). On improving genetic programming for symbolic regression. 1, 912-919 Vol.1. https://doi.org/10.1109/CEC.2005.1554780

Harel, G., & Koichu, B. (2010). An operational definition of learning. The Journal of Mathematical Behavior, 29(3), 115-124. https://doi.org/10.1016/j.jmathb.2010.06.002

Hebb, D. O. (1949). The organization of behavior; a neuropsychological theory (ss. xix, 335). Wiley.

Hendershott, T., Jones, C. M., & Menkveld, A. J. (2011). Does Algorithmic Trading Improve Liquidity? The Journal of Finance, 66(1), 1-33. https://doi.org/10.1111/j.1540-6261.2010.01624.x

Hendershott, T., & Riordan, R. (2012). Algorithmic Trading and the Market for Liquidity (SSRN Scholarly Paper 2001912). https://doi.org/10.2139/ssrn.2001912

Ho, Y.-W., Rawat, T., Yang, Z.-K., Pratik, S., Lai, G.-W., Tu, Y.-L., & Lin, A. (2021). Neuroevolution-Based Efficient Field Effect Transistor Compact Device Models. IEEE Access, PP, 1-1. https://doi.org/10.1109/ACCESS.2021.3130254

Holland, J. H. (1992). Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence. The MIT Press. https://doi.org/10.7551/mitpress/1090.001.0001

Holt, S., Qian, Z., & van der Schaar, M. (2023). Deep Generative Symbolic Regression (arXiv:2401.00282). arXiv. https://doi.org/10.48550/arXiv.2401.00282

Howard, R. A. (1960). Dynamic programming and Markov processes (ss. viii, 136). John Wiley.

International Securities Commissions. (2011). Regulatory Issues Raised by the Impact of Technological Changes on Market Integrity and Efficiency.

Işığıçok, E. (2015). Karar Vermeye Giriş. İçinde Karar Verme (1. bs).

Jelvis, A. R., Tikhon. (2022). Foundations of Reinforcement Learning with Applications in Finance. Chapman and Hall/CRC. https://doi.org/10.1201/9781003229193

Johnson, B. (2010). Algorithmic trading & DMA: An introduction to direct access trading strategies. 4Myeloma Press.

Josiński, H., Kostrzewa, D., Michalczuk, A., & Świtoński, A. (2014). The Expanded Invasive Weed Optimization Metaheuristic for Solving Continuous and Discrete Optimization Problems. The Scientific World Journal, 2014, 1-14. https://doi.org/10.1155/2014/831691

Kaelbling, L. P., Littman, M. L., & Moore, A. W. (1996). Reinforcement Learning: A Survey (arXiv:cs/9605103). arXiv. http://arxiv.org/abs/cs/9605103

Khamsi, M. A., & Kirk, W. A. (2001, Mart 6). An Introduction to Metric Spaces and Fixed Point Theory. https://doi.org/10.1002/9781118033074

Kissell, R. (2013). The Science of Algorithmic Trading and Portfolio Management: Applications Using Advanced Statistics, Optimization, and Machine Learning Techniques (1st edition). Academic Press.

Klopf, A. H. (1972). Brain Function and Adaptive Systems: A Heterostatic Theory. Air Force Cambridge Research Laboratories, Air Force Systems Command, United States Air Force.

Koza, J. R. (1994). Genetic programming as a means for programming computers by natural selection. Statistics and Computing, 4(2), 87-112. https://doi.org/10.1007/BF00175355

Kumar, A., Nadeem, M., & Banka, H. (2023). Nature inspired optimization algorithms: A comprehensive overview. Evolving Systems, 14(1), 141-156. https://doi.org/10.1007/s12530-022-09432-6

Lachman, S. J. (1997). Learning is a Process: Toward an Improved Definition of Learning. The Journal of Psychology, 131(5), 477-480. https://doi.org/10.1080/00223989709603535

Lanz, P. (2000). The Concept of Intelligence in Psychology and Philosophy. Içinde H. Cruse, J. Dean, & H. Ritter (Ed.), Prerational Intelligence: Adaptive Behavior and Intelligent Systems Without Symbols and Logic, Volume 1, Volume 2 Prerational Intelligence: Interdisciplinary Perspectives on the Behavior of Natural and Artificial Systems, Volume 3 (ss. 19-30). Springer Netherlands. https://doi.org/10.1007/978-94-010-0870-9_3

Li, C.-C., Liu, J.-C., Lin, C.-H., & Lo, W. (2015). On the Accelerated Convergence of Genetic Algorithm Using GPU Parallel Operations. International Journal of Software Innovation, 3, 1-17. https://doi.org/10.4018/IJSI.2015100101

Lin, L.-J. (1992). Self-improving reactive agents based on reinforcement learning, planning and teaching. Machine Learning, 8(3), 293-321. https://doi.org/10.1007/BF00992699

Matsubara, Y., Chiba, N., Igarashi, R., & Ushiku, Y. (2023). Rethinking Symbolic Regression Datasets and Benchmarks for Scientific Discovery (arXiv:2206.10540). arXiv. https://doi.org/10.48550/arXiv.2206.10540

Méndez-Molina, A., F.Morales, E., & Sucar, L. E. (2022). Causal Discovery and Reinforcement Learning: A Synergistic Integration. Proceedings of The 11th International Conference on Probabilistic Graphical Models, 421-432. https://proceedings.mlr.press/v186/mendez-molina22a.html

Minsky, M. (1954). Theory of neural-analog reinforcement systems and its application to the brain-model problem.

Minsky, M. (1961). Steps toward Artificial Intelligence. Proceedings of the IRE, 49(1), 8-30. https://doi.org/10.1109/JRPROC.1961.287775

Mitchell, M. (1998). An Introduction to Genetic Algorithms. MIT Press.

Mnih, V., Kavukcuoglu, K., Silver, D., Graves, A., Antonoglou, I., Wierstra, D., & Riedmiller, M. (2013). Playing Atari with Deep Reinforcement Learning (arXiv:1312.5602). arXiv. http://arxiv.org/abs/1312.5602

Murphy, J. J. (1999). Technical Analysis of the Financial Markets: A Comprehensive Guide to Trading Methods and Applications. Penguin Publishing Group.

Naeem, M., Rizvi, S. T. H., & Coronato, A. (2020). A Gentle Introduction to Reinforcement Learning and its Application in Different Fields. IEEE Access, 8, 209320-209344. https://doi.org/10.1109/ACCESS.2020.3038605

Nasar, A. A. (2016). The history of Algorithmic complexity. The Mathematics Enthusiast, 13(3), 217-242. https://doi.org/10.54870/1551-3440.1375

Naser, M. Z. (2021). An engineer’s guide to eXplainable Artificial Intelligence and Interpretable Machine Learning: Navigating causality, forced goodness, and the false perception of inference. Automation in Construction, 129, 103821. https://doi.org/10.1016/j.autcon.2021.103821

Nisbett, R. E., Aronson, J., Blair, C., Dickens, W., Flynn, J., Halpern, D. F., & Turkheimer, E. (2012). Intelligence: New findings and theoretical developments. The American Psychologist, 67(2), 130-159. https://doi.org/10.1037/a0026699

Nuti, G., Mirghaemi, M., Treleaven, P., & Yingsaeree, C. (2011). Algorithmic Trading. Computer, 44, 61-69. https://doi.org/10.1109/MC.2011.31

Petersen, B. K., Landajuela, M., Mundhenk, T. N., Santiago, C. P., Kim, S. K., & Kim, J. T. (2021). Deep symbolic regression: Recovering mathematical expressions from data via risk-seeking policy gradients (arXiv:1912.04871). arXiv. https://doi.org/10.48550/arXiv.1912.04871

Peterson, J. K. (2016). Function Approximation. Içinde J. K. Peterson (Ed.), Calculus for Cognitive Scientists: Derivatives, Integrals and Models (ss. 279-299). Springer. https://doi.org/10.1007/978-981-287-874-8_14

Prix, J., Loistl, O., & Huetl, M. (2007). Algorithmic Trading Patterns in Xetra Orders. European Journal of Finance, 13, 717-739. https://doi.org/10.1080/13518470701705538

Pujol, G., & Brueckner, A. (2009). Smart Order Routing and Best Execution. AMCIS 2009 Proceedings. https://aisel.aisnet.org/amcis2009/155

Raheema, M., & Abdullah, A. (2017). Function Approximation using Neural and Fuzzy Methods.

Raymer, M., Punch, W., Goodman, E., Kuhn, L., & Jain, A. (2000). Dimensionality Reduction Using Genetic Algorithms. Evolutionary Computation, IEEE Transactions on, 4, 164-171. https://doi.org/10.1109/4235.850656

Reiher, W. (1966). Hammersley, J. M., D. C. Handscomb: Monte Carlo Methods. Methuen & Co., London, and John Wiley & Sons, New York, 1964. VII + 178 S., Preis: 25 s. Biometrische Zeitschrift, 8(3), 209-209. https://doi.org/10.1002/bimj.19660080314

Riordan, R. (2009). The Economics of Algorithmic Trading [PhD Thesis, Karlsruhe, Univ., Diss., 2009]. https://scholar.archive.org/work/jczccv3hjnbdziadbfijebktay/access/wayback/https://publikationen.bibliothek.kit.edu/1000014028/1080547

Rosen, K. (2011). Discrete Mathematics and Its Applications Seventh Edition (7th edition). McGraw Hill.

Rosenblatt, F. (1957). The Perceptron: A Perceiving and Recognizing Automaton (85-460-1). https://blogs.umass.edu/brain-wars/files/2016/03/rosenblatt-1957.pdf

Rudin, C., & Radin, J. (2019). Why Are We Using Black Box Models in AI When We Don’t Need To? A Lesson From an Explainable AI Competition. Harvard Data Science Review, 1(2). https://doi.org/10.1162/99608f92.5a8a3a3d

Russell, S., & Norvig, P. (2009). Artificial Intelligence: A Modern Approach (3rd edition). Pearson.

Silver, D. (2015). Lectures on Reinforcement Learning. David Silver. https://www.davidsilver.uk/teaching/

Sun, F., Liu, Y., Wang, J.-X., & Sun, H. (2022). Symbolic Physics Learner: Discovering governing equations via Monte Carlo tree search (arXiv:2205.13134; Versiyon 1). arXiv. http://arxiv.org/abs/2205.13134

Sutton, R. (t.y.). Learning theory support for a single channel theory of the brain.

Sutton, R. (1988). Learning to Predict by the Method of Temporal Differences. Machine Learning, 3, 9-44. https://doi.org/10.1007/BF00115009

Sutton, R., & Barto, A. G. (1981). An Adaptive Network That Constructs and Uses an Internal Model of its World. 4(3), 217-246.

Sutton, R. S., & Barto, A. G. (1981). Toward a modern theory of adaptive networks: Expectation and prediction. Psychological Review, 88(2), 135-170.

Sutton, R. S., & Barto, A. G. (1987). A temporal-difference model of classical conditioning. Proceedings of the ninth annual conference of the cognitive science society, 355-378.

Sutton, R. S., & Barto, A. G. (1998). Reinforcement Learning: An Introduction. MIT Press.

Tenachi, W., Ibata, R., & Diakogiannis, F. I. (2023). Deep Symbolic Regression for Physics Guided by Units Constraints: Toward the Automated Discovery of Physical Laws. The Astrophysical Journal, 959(2), 99. https://doi.org/10.3847/1538-4357/ad014c

Tesauro, G. (1995). Temporal Difference Learning and TD-Gammon.

Thrun, S., & Schwartz, A. (1999). Issues in Using Function Approximation for Reinforcement Learning. https://www.semanticscholar.org/paper/Issues-in-Using-Function-Approximation-for-Learning-Thrun-Schwartz/26b8747eb4d7fb4d4fc45707606d5e969b9afb0c

Tiganj, Z., Gershman, S. J., Sederberg, P. B., & Howard, M. W. (2019). Estimating Scale-Invariant Future in Continuous Time. Neural Computation, 31(4), 681-709. https://doi.org/10.1162/neco_a_01171

Treleaven, P., Galas, M., & Lalchand, V. (2013). Algorithmic Trading Review. Communications of the ACM, 56, 76-85. https://doi.org/10.1145/2500117

Udrescu, S.-M., & Tegmark, M. (2021). Symbolic pregression: Discovering physical laws from distorted video. Physical Review E, 103(4), 043307. https://doi.org/10.1103/PhysRevE.103.043307

US: Securities and Exchange Commission. (2010). Concept Release on Equity Market Structure. Part III: Proposed Rules. Federal Register. Vol. 75, No. 13, 3331-3614.

Wang, Y., Wagner, N., & Rondinelli, J. M. (2019). Symbolic regression in materials science. MRS Communications, 9(3), 793-805. https://doi.org/10.1557/mrc.2019.85

Watkins, C. (1989). Learning From Delayed Rewards.

Watkins, C., & Dayan, P. (1992). Technical Note: Q-Learning. Machine Learning, 8, 279-292. https://doi.org/10.1007/BF00992698

Whiteson, S. (2010). Adaptive Representations for Reinforcement Learning (C. 291). Springer Berlin Heidelberg. https://doi.org/10.1007/978-3-642-13932-1

Wiering, M. (1999). Explorations in efficient reinforcement learning. https://www.semanticscholar.org/paper/Explorations-in-efficient-reinforcement-learning-Wiering/d5a978148cb0ea702ec0671260786e6b03fd2a39

Wiering, M., & van Otterlo, M. (2014). Reinforcement Learning: State-of-the-Art. Springer Publishing Company, Incorporated.

Wu, Y. M. (2012). The Comparison of Different Chromosome Coding Method Based on Genetic Algorithm. Advanced Materials Research, 546-547, 666-669. https://doi.org/10.4028/www.scientific.net/AMR.546-547.666

Zhang, M., Kim, S., Lu, P. Y., & Soljačić, M. (2023). Deep Learning and Symbolic Regression for Discovering Parametric Equations (arXiv:2207.00529). arXiv. http://arxiv.org/abs/2207.00529

İndir

PDF

Satın almak istiyorum

Yayınlanan

22 Kasım 2024

Lisans

Bu İnternet Sitesi içeriğinde yer alan tüm eserler (yazı, resim, görüntü, fotoğraf, video, müzik vb.) Akademisyen Kitabevine ait olup, 5846 sayılı Fikir ve Sanat Eserleri Kanunu ve 5237 sayılı Türk Ceca Kanunu kapsamında korunmaktadır. Bu hakları ihlal eden kişiler, 5846 sayılı Fikir ve Sanat eserleri Kanunu ve 5237 sayılı Türk Ceza Kanununda yer alan hukuki ve cezai yaptırımlara tabi olurlar. Yayınevi ilgili yasal yollara başvurma hakkına sahiptir.

Mevcut yayın biçimiyle ilgili ayrıntılar: PDF

PDF

ISBN-13 (15)

978-625-375-140-1

Publication date (01)

2024

Makine Öğrenmesinde Açıklanabilirlik Kavramı: Derin Pekiştirmeli Öğrenme ve Sembolik Regresyon ile Hibrit Yaklaşım

Yazarlar

Özet

Referanslar

İndir

Yayınlanan

Lisans

Mevcut yayın biçimiyle ilgili ayrıntılar: PDF

ISBN-13 (15)

Publication date (01)

Fiziksel Boyutlar