Articles | Open Access |

The Sonic and Textual Architecture of Financial Markets: A Multimodal Deep Learning Analysis of Executive Affect in Corporate Disclosures

Julian Antonelli , Department of Computational Finance, National University of Singapore, Singapore

Abstract

This research investigates the complex interplay between verbal and non-verbal communication in the context of corporate financial disclosures, specifically focusing on earnings conference calls and SEC filings. Historically, financial analysis has relied heavily on the quantitative data found in balance sheets; however, this study posits that the emotional state and vocal delivery of executives provide a significant, under-explored signal to the market. By synthesizing principles from communication theory, psychology, and advanced deep learning, the paper develops a multimodal sentiment analysis framework. We examine the "ripple effect" of emotional contagion and its influence on investor behavior, utilizing phonetic analysis tools to decode paralinguistic cues such as pitch, hesitation, and emphasis. The study further integrates web mining techniques to map innovation ecosystems and evaluate how distance from innovation hubs affects AI adoption and disclosure quality. Our findings suggest that textual sentiment in SEC filings, when moderated by the vocal affect of CEOs during live calls, significantly influences post-earnings announcement drift and analyst recommendations. The integration of deep learning models for predictive innovation mapping reveals that "noise" on internet message boards often masks high-value emotional signals. This comprehensive framework offers a new paradigm for understanding market efficiency through the lens of emotive communication pragmatics and automated sentiment extraction.

Keywords

Multimodal Sentiment Analysis, Earnings Calls, Financial Disclosures, Deep Learning

References

Andersen, Peter A., and Laura K. Guerrero. 1998. Principles of communication and emotion in social interaction, in P.A. Andersen, and L.K. Guerrero, eds. Handbook of Communication and Emotion: Research, Theory, Applications, and Contexts (Academic Press, San Diego, CA).

Antweiler, Werner, and Murray Z. Frank. 2004. Is all that talk just noise? The information content of internet stock message boards? Journal of Finance 59, 1259–1294.

Arnold, Magda B. 1960. Emotion and Personality (Columbia University Press, New York, NY).

Barsade, Sigal G. 2002. The ripple effect: Emotional contagion and its influence on group behavior, Administrative Science Quarterly 47, 644–675.

Bernard, Victor, and Jacob Thomas. 1989. Post earnings announcement drift: Delayed price response or risk premium? Journal of Accounting Research (Supplement), 1–36.

Bligh, Michelle C., and Gregory D. Hess. 2007. The power of leading subtly: Alan Greenspan, rhetorical leadership, and monetary policy, The Leadership Quarterly 18, 87–104.

Boersma, Paul, and David Weenink. 2010. PRAAT: Doing phonetics by computer (version 5.2.05) [Computer program], retrieved from http://www.praat.org/ on December 13, 2010.

Bradshaw, Mark. 2004. How do analysts use their earnings forecasts in generating stock recommendations The Accounting Review 79, 25–50.

Brown, Troy E., Stuart M. Senter, and Andrew H. Ryan, Jr. 2003. Ability of the Vericator to detect smugglers at a mock security checkpoint, Department of Defense Polygraph Institute report number DoDPI03-R-0002.

Bushee, Brian, Michael Jung, and Gregory Miller. 2011. Conference presentations and the disclosure milieu, Journal of Accounting Research 49, 1163–1192.

Cacioppo, John T., and Gary G. Bernston. 1994. Relationship between attitudes and evaluative space: A critical review, with emphasis on the separability of positive and negative substrates, Psychological Bulletin 115, 401–423.

Caffi, Claudia, and Richard W. Janney. 1994. Toward a pragmatics of emotive communication, Journal of Pragmatics 22, 325–373.

Hunt, J., Cockburn, I., & Bessen, J. (2024). Is distance from innovation a barrier to the adoption of artificial intelligence? Technical Report No. w33022 (2025-04-14). National Bureau of Economic Research. https://doi.org/10.3386/w33022

Jonsson, A., & Svingby, G. (2007). The use of scoring rubrics: Reliability, validity and educational consequences. Educational Research Review, 22, 130–144. https://doi.org/10.1016/j.edurev.2007.05.002

Kinne, J., & Axenbeck, J. (2020). Web mining for innovation ecosystem mapping: A framework and a large-scale pilot study. Scientometrics, 125(3), 2011–2041. https://doi.org/10.1007/s11192-020-03726-9

Kinne, J., & Lenz, D. (2021). Predicting innovative firms using web mining and deep learning. PLoS ONE, 164, e0249071. https://doi.org/10.1371/journal.pone.0249071

Koster, M., Illyes, G., Zeller, H., & Sassman, L. (2022). Robots exclusion protocol. RFC Editor. https://doi.org/10.17487/RFC9309

Landis, J.-R., & Koch, G.-G. (1977). The measurement of observer agreement for categorical data. Biometrics, 331, 159. https://doi.org/10.2307/2529310

Lee, M.-C., Scheepers, H., Lui, A.-K., & Ngai, E.-W. (2023). The implementation of artificial intelligence in organizations: A systematic literature review. Information and Management, 605, 103816. https://doi.org/10.1016/j.im.2023.103816

Priyank Tailor, & Anjali Kale. (2025). Multimodal Sentiment Analysis of Earnings Calls and SEC Filings: A Deep Learning Approach to Financial Disclosures. Utilitas Mathematica, 122(1), 3163–3168. Retrieved from https://utilitasmathematica.com/index.php/Index/article/view/2676

Article Statistics

Downloads

Download data is not yet available.

Copyright License

Download Citations

How to Cite

Julian Antonelli. (2025). The Sonic and Textual Architecture of Financial Markets: A Multimodal Deep Learning Analysis of Executive Affect in Corporate Disclosures. International Journal of Computer Science & Information System, 10(12), 74–81. Retrieved from http://scientiamreearch.org/index.php/ijcsis/article/view/358