Nox Vahn & Marsh | Come Together

Meditative Pixels | Unraveling Screen and AI Obsession in a Meta Lyric Music Video Featuring Alan Watts from Nox Vahn & Marsh’s ‘Come Together’

“Meditative Pixels” aims to be a thought-provoking and innovative electronic music video that critiques how we consume content on screens and our growing obsession with AI-generated content.

Watch here on Instagram

Initially conceived as a project for a Time-Based Design class at Parsons, the video evolved from an assignment focused on kinetic text into a unique, multi-layered exploration of screen culture and AI. Featuring Nox Vahn & Marsh’s track “Come Together,” which masterfully remixes an excerpt from Alan Watts’ insightful lecture “Getting Into the Meditative State” into an immersive electronic soundscape, the video employs AI-generated visuals to create a captivating experience.

The video takes viewers on a journey through the evolution of screen consumption, starting with karaoke machines, which popularized displaying lyrics on-screen.

As the video progresses, we transition to the era of desktop computers, where lyric videos gained traction on platforms like YouTube.

Finally, we arrive at the present day, where mobile screens dominate our lives and have become the primary medium for consuming content.

“Throughout the video, AI-generated imagery is used to create the visuals, reflecting our growing fascination with artificial intelligence and its potential to shape the content we consume. The remixed lyrics of Alan Watts’ lecture are presented in a way that both parodies and critiques the conventions of lyric music videos, juxtaposing Watts’ profound message about meditation and the interconnectedness of the inner and outer world with the ever-changing screens and AI-generated content.”

“Meditative Pixels” aims to challenge viewers to question their relationship with screens, AI-generated content, and how they consume media. By integrating Alan Watts’ powerful insights with Nox Vahn & Marsh’s captivating electronic track, AI-generated visuals, and a critique of the evolution of screens, “Meditative Pixels” aims to offer a unique and engaging perspective on the role of screens and AI in our lives and the impact they have on our understanding of music, spoken word, and ourselves. Personally, the video serves as a reminder to step back, reevaluate our screen habits and fascination with AI, and focus on the message rather than the medium.

AI Image-Generated Content:

In creating the “Meditative Pixels” music video, I tried an innovative approach to generating captivating visuals by utilizing the outpainting capabilities of OpenAI’s advanced generative model, DALL-E 2. The process began with sourcing images from the platform Midjourney, which provided the base content for the AI-powered extrapolation.

DALL-E 2, a powerful generative model, combines extensive training data with natural language understanding to analyze visual structures, styles, and patterns. By providing a text prompt that describes the desired output, DALL-E 2 can generate images that extend or modify the original image while maintaining visual coherence.

“The outpainting process involves selecting an image to extend or modify, creating a larger canvas with the original image inside, and preparing a text prompt that describes the desired output. Certain elements or areas can be erased or masked from the original image using image editing tools before outpainting, allowing the AI model to generate new content in those areas based on the prompt.

With the prepared image and prompt, DALL-E 2 generates outpainted frames that extend or modify the content accordingly. The generated frames are then reviewed and refined, ensuring they align with the creator’s vision. Finally, the AI-generated content is seamlessly blended with the original image using image editing software, resulting in a visually appealing final piece.”

— ChatGPT

For instance, one of the images sourced from Midjourney depicted people singing karaoke in Japan. To extend this image using DALL-E 2’s outpainting feature, the prompt “1960s Japan, vintage, women singing karaoke with a karaoke machine in the background” was provided. This prompted the AI model to generate an extrapolated image that expanded the scene while preserving the elements of the original image, adding a 1960s vintage aesthetic, and placing a karaoke machine in the background.

More by Joey Foster Ellis

View profile