Let’s Practice Better... on Cats: Description and Visualisation of Artistic Images in Generative AI Models
pdf (Русский)
html (Русский)

Keywords

Digital Art Generative Models Neural Networks Artistic Image Visualisation Socio-Cultural Context Prompt DALL·E Stable Diffusion Kandinsky

How to Cite

Khandogin, R., & Proner, N. (2024). Let’s Practice Better. on Cats: Description and Visualisation of Artistic Images in Generative AI Models. Galactica Media: Journal of Media Studies, 6(4), 160-193. https://doi.org/10.46539/gmd.v6i4.554

Abstract

Artificial Intelligence (AI) plays an increasingly prominent role in various spheres of life in today’s world, including generation of a variety of visual content from selfie stream processing to creating works of digital art. The present paper raises the question of whether AI is capable of creating real art or it just imitates its external form. The paper examines the specificity of prompts: from concrete named ones to interpretive descriptive queries in linguistic, artistic and socio-cultural contexts. The article dwells upon some important aspects of evaluating the quality of keyword extraction algorithms and their relation to artistic practice. The authors rely on semiotic analysis to uncover encoded meanings and imports in the text. The article emphasises that the literary text is at the top of the hierarchy of cultural texts; it is characterised by intentionality and coherence and represents a complex semantic field where key words and images interact with the explicit and implicit contexts. The study examines and analyses the visualised images of Cheshire Cat, Cat Behemoth and Tomcat Murr created by the authors with the use of three generative neural networks: Stable Diffusion, Dall‑E and Kandinsky. Understanding and visualising the literary text by generative systems and models realising specific algorithms requires the ability to reveal its multilayered semantics and connection with the cultural context, which ultimately helps to understand the in-depth meanings of the work and its place in culture. Consideration of the operational quality of algorithms for keyword system extraction and image generation is deemed possible from the point of view of their structural organisation. Generative algorithms create an imitative reality, while the immanence of the artistic value determines the uniqueness and meanings of the created figurative world. The article can be useful to anyone interested in the substance and specificity of digital art, the relationship between technological innovations and socio-cultural context, the creation and visualisation of artistic images in generative AI models, their conceptualisation and interpretation.

https://doi.org/10.46539/gmd.v6i4.554
pdf (Русский)
html (Русский)

References

Aharonov, Y., Cohen, E., & Popescu, S. (2021). A dynamical quantum Cheshire Cat effect and implications for counterfactual communication. Nature Communications, 12(1), 4770. https://doi.org/10.1038/s41467-021-24933-9

Bartel, C., & Kwong, J. M. C. (2021). Pluralism, Eliminativism, and the Definition of Art. Estetika: The European Journal of Aesthetics, LVIII/XIV(2), 100–113. https://doi.org/10.33134/eeja.213

Bulgakov, M. A. (2016). The Master and Margarita. Eksmo Publ. (In Russian).

Carroll, L. (2019). Alice's Adventures in Wonderland. ROSMEN Publ. (In Russian).

Chudakova, M. O. (2011). Not for adults. Time to read. Shelf three. Time. (In Russian).

Fadeeva, T. E. (2023). “Union” of an Artist with a Non-Human Agent: Utopia or a Working Model of Artistic Production? Izvestiya of the Samara Science Centre of the Russian Academy of Sciences. Social, Humanitarian, Medicobiological Sciences, 25(88), 108–115. https://doi.org/10.37313/2413-9645-2023-25-88-108-115 (In Russian).

Floridi, L., & Chiriatti, M. (2020). GPT-3: Its Nature, Scope, Limits, and Consequences. Minds and Machines, 30(4), 681–694. https://doi.org/10.1007/s11023-020-09548-1

Hegel, G. W. F. (2007). Lectures on Aesthetics (Volumes 1-2). Nauka Publ. (In Russian).

Heruti, V., & Mashal, N. (2023). Effects of an Art Intervention Program Using Ambiguous Image-Text Interactions on Creative Thinking. Empirical Studies of the Arts, 02762374231215736. https://doi.org/10.1177/02762374231215736

Hoffmann, E. (2013). The Life and Opinions of the Tomcat Murr. Azbuka Publ. (In Russian).

Kenett, Y. N., Gold, R., & Faust, M. (2018). Metaphor Comprehension in Low and High Creative Individuals. Frontiers in Psychology, 9, 482. https://doi.org/10.3389/fpsyg.2018.00482

Khurana, D., Koli, A., Khatter, K., & Singh, S. (2023). Natural language processing: State of the art, current trends and challenges. Multimedia Tools and Applications, 82(3), 3713–3744. https://doi.org/10.1007/s11042-022-13428-4

Krivtsun, O. A. (2019). The Evolution of the Language of Art: Cultural and Artistic Influences. Art & Culture Sudies, 2, 2–25. https://doi.org/10.24411/2226-0072-2019-00012 (In Russian).

Kutela, B., Novat, N., Novat, N., Herman, J., Kinero, A., & Lyimo, S. (2023). The Rise of Prompt Engineering Jobs: A Review of the Nature of the Job, Qualifications, and Compensations. SSRN Electronic Journal. https://doi.org/10.2139/ssrn.4625139

Makarov, I., Makarov, M., & Kiselev, D. (2021). Fusion of text and graph information for machine learning problems on networks. PeerJ Computer Science, 7, e526. https://doi.org/10.7717/peerj-cs.526

Milovidov, S. V. (2022). Artistic Features of Computer Artworks Creating with Machine Learning Technology. Articult, 4, 36–48. https://doi.org/10.28995/2227-6165-2022-4-36-48 (In Russian).

Nefyodov, I. V., & Chigrina, A. V. (2016). Reflections of language play in translations of Lewis Carroll's tale “Alice in Wonderland”. Aktualnie problemi gumanitarnikh i estestvennikh nauk, 6–4, 55–60. (In Russian).

Nichol, A., Dhariwal, P., Ramesh, A., Shyam, P., Mishkin, P., McGrew, B., Sutskever, I., & Chen, M. (2021). GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models. arXiv. https://doi.org/10.48550/ARXIV.2112.10741

Paananen, V., Oppenlaender, J., & Visuri, A. (2024). Using text-to-image generation for architectural design ideation. International Journal of Architectural Computing, 22(3), 458–474. https://doi.org/10.1177/14780771231222783

Rasrichai, K., Chantarutai, T., & Kerdvibulvech, C. (2023). Recent Roles of Artificial Intelligence Artists in Art Circulation. Digital Society, 2(2), 15. https://doi.org/10.1007/s44206-023-00044-4

Saratov State University, & Tikhonova, S. V. (2023). Sentimental visual content of the new normality: How digital cats domesticate crises. Vestnik of Saint Petersburg University. Sociology, 16(2), 149–167. https://doi.org/10.21638/spbu12.2023.203 (In Russian).

Schlöder, J. J., & Altshuler, D. (2023). Super Pragmatics of (linguistic-)pictorial discourse. Linguistics and Philosophy, 46(4), 693–746. https://doi.org/10.1007/s10988-022-09374-x

Schneiders, E., Chamberlain, A., Fischer, J. E., Benford, S., Castle-Green, S., Ngo, V., Kucukyilmaz, A., Barnard, P., Row Farr, J., Adams, M., Tandavanitj, N., Devlin, K., Mancini, C., & Mills, D. (2023). TAS for Cats: An Artist-led Exploration of Trustworthy Autonomous Systems for Companion Animals. Proceedings of the First International Symposium on Trustworthy Autonomous Systems, 1–5. https://doi.org/10.1145/3597512.3597517

Steinfeld, K. (2023). Clever little tricks: A socio-technical history of text-to-image generative models. International Journal of Architectural Computing, 21(2), 211–241. https://doi.org/10.1177/14780771231168230

Tabačková, Z. (2018). Narrative odyssey of a congenital immigrant: The art of storytelling in Rabihalameddine’s the Angel of History. Silesian Studies in English 2018. Proceedings of the 5th International Conference of English and American Studies 6th – 7th September 2018, 167–175.

Taulli, T. (2023). ChatGPT and Bard for Business Automation: Achieving AI-Driven Growth. Apress. https://doi.org/10.1007/978-1-4842-9852-7 (In Russian).

Vanyushkin, A. S., & Graschenko, L. A. (2016). Keyword extraction methods and algorithms. Novie informatsionnie tekhnologii v avtomatizirovannikh sistemakh, 19, 85–93. (In Russian).

Zhang, B. (2023). From “Bildungsroman” to “World Classic Fairy Tale”: The Translation and Reception of “Bambi” in Modern China. Journal of Social Science Humanities and Literature, 6(6), 129–138. https://doi.org/10.53469/jsshl.2023.06(06).22 (In Russian).

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 International License.