Categories
Featured Students

AI Month, NSF Fellowship, & Enhancing Language Models

In celebration of AI Month, we are thrilled to extend our congratulations to Andrew Zhu, under the guidance of Chris Callison-Burch, for his remarkable achievement in receiving the prestigious 2024 NSF Graduate Research Fellowship Program (GRFP). Andrew’s dedication and innovative vision have earned him this esteemed recognition, marking a significant milestone in his academic journey.

Andrew’s research endeavors are poised to address a critical challenge in the realm of Language Model Machines or Large Language Models (LLMs), where despite their formidable capabilities, they often grapple with the issue of ‘forgetting’ when confronted with complex inquiries or tasks spanning extensive timelines. For instance, while humans effortlessly navigate through multi-step queries, LLMs encounter hurdles in retaining the original context across successive stages of information retrieval. Inspired by this gap, Andrew’s previous work, FanOutQA, shed light on the limitations of current models, showcasing the vast barrier between human accuracy and machine performance.

High-level diagram showing the settings in the FanOutQA benchmark: the LLM (robot) must answer a complex question by breaking it down (bottom). There are 3 settings we test LLMs in: closed-book (the LLM cannot use any search tools, only what it was trained on), open-book (the LLM may search Wikipedia), and evidence-provided (the LLM is given the Wikipedia pages containing the answers and must extract the correct answers).

Central to Andrew’s proposed solution is the concept of recursive delegation, a groundbreaking approach aimed at endowing LLMs with the ability to allocate intricate tasks to subordinate LLMs. This hierarchical structure allows for the decomposition of complex problems into more manageable segments, with each ‘child’ LLM equipped to handle its designated subtask. Through this cascading delegation, Andrew envisions a synergistic collaboration among LLMs, comparable to a relay race where each participant contributes to achieving the final objective.

In laying the groundwork for his research, Andrew has developed Kani, a versatile framework designed to facilitate seamless interaction between LLMs and human-written code. This innovative tool empowers researchers to experiment with different LLM architectures while interfacing with Python functions effortlessly. By democratizing access to such tools, Andrew aims to catalyze progress in the field and foster a collaborative ecosystem of exploration and discovery.

Screenshot of an early proof-of-concept recursive delegation system I developed this year. In this example, it’s helping make travel plans for a Japan trip and looking at Shinkansen (bullet train) travel times on the internet. Each node (in the top right) is an independent LLM. (green=in progress, yellow=waiting, gray=completed, blue=selected)
The same as above, but with the code needed to actually use Kani. It’s designed to be easy to pick up while allowing experienced developers to customize the library extensively.
An overview of what the Kani system is comprised of. It manages the chat history and provides it and user-written functions to underlying LLMs (engines) with a common interface.

Andrew’s advisor, Chris Callison-Burch, effectively summarizes the significance of his work, stating,

“Andrew Zhu’s research is at the cutting edge of programming languages, natural language processing, and artificial intelligence. His groundbreaking open-source software package, Kani, elegantly solves a major weakness of large language models (LLMs) like ChatGPT by enabling them to write code and call functions. Kani seamlessly integrates LLMs into Python programs, allowing developers to write functions that can be called by the language model. This game-changing approach combines the strengths of AI and traditional programming, enabling LLMs to interweave natural language generation with complex computations.

Andrew’s work has the potential to revolutionize how we develop AI systems, paving the way for more powerful and versatile AI applications that go beyond ChatGPT’s current abilities of understanding and generating natural language to also include the abilities to create and manipulate code.”

Chris Callison-Burch

As Andrew embarks on this ambitious journey, his work holds the promise of transforming the landscape of artificial intelligence, ushering in a new era of collaboration and problem-solving. We eagerly anticipate the insights and advancements that will emerge from his pioneering efforts, underscoring the transformative potential of AI in shaping our collective future. Once again, congratulations to Andrew Zhu on this well-deserved recognition, and here’s to a future marked by innovation, collaboration, and boundless possibilities in the realm of AI.

References:

[1] Andrew Zhu, Alyssa Hwang, Liam Dugan, and Chris Callison-Burch. 2024. FanOutQA: Multi-Hop, Multi-Document Question Answering for Large Language Models. Arxiv preprint, in submission at ACL 2024. https://arxiv.org/abs/2402.14116

[2] Andrew Zhu, Liam Dugan, Alyssa Hwang, and Chris Callison-Burch. 2023. Kani: A Lightweight and Highly Hackable Framework for Building Language Model Applications. In Proceedings of the 3rd Workshop for Natural Language Processing Open Source Software (NLP-OSS 2023), pages 65–77, Singapore. Association for Computational Linguistics. https://aclanthology.org/2023.nlposs-1.8/

Categories
Students

The C.I.S. Blog Presents- “DALL-E Art Gallery”

University of Pennsylvania, by Beeple (prompt generated by Yuxin Meng)

So far in this semester, our department blog has talked about the growths in our labs and our faculty members careers. We have touched on the incredible seminars that have been taking place as well as the exciting research projects that our students are involved in. The blog’s overall purpose is to showcase the technological strides that Penn Engineering is making as well as the significant academic achievements of our faculty and student body. The C.I.S. blog is also a platform that strives to implement humanity and relatability to those who are a part of the Penn community and those outside of the University.

As many of you may know, DALL-E 2 developed by OpenAI was launched in April of this year and has just recently been made available to anyone. You get an allowance of free credits that enable you to type in a detailed description of anything that comes to mind and the machine learning models generate digital images that reflect the prompt. We were interested in the combination between artificial intelligence and the ability to generate realistic images using human ingenuity.

DALL-E is a great example of AI and human collaboration working to break barriers and expand horizons through artistic creativity. This platform also gives people the ability to play and be as out there and imaginative as they want. All in all, DALL-E gives us the opportunity to have fun and explore our hobbies, interests, and studies in the form of art. To showcase this AI system and what it can do I had asked all students from the C.I.S. department, from Undergrad to Ph.D., to send in the descriptions that they prompted and to have fun with it.

With that being said, this Gallery of DALL-E generated art was made possible by some of our wonderful students in the C.I.S. Department!


Ani Petrosyan, she/her
Computer Science major, 2026

Purple mountains with Armenian waterfalls

"I am an international student from Armenia. It's a very mountainous country, with waterfalls and wonderful nature. One fun fact about me: most of the dreams I see have purple color in them, so I am dreaming of my country and seeing it in purple.

Edward Hu, he/him
Ph.D. in Robot Learning

A. Bob Ross in the style of Picasso uncanny unreal engine

"Bob Ross is an iconic painter, so I would like AI painters to pay homage to him."

B. Darth Vader cooking in Hell's Kitchen

"Hell's Kitchen is one of my favorite shows. I think Darth Vader's past with high heat and pressure scenarios would make him an excellent contestant."

Yuxin Meng, she/her
MCIT, 2024

A. Hacker, another dimension, digital art

"This one was intended to be "software engineer..." or "coding in..." but these keywords seemed to be less instructive compared to "hacker". I wanted to see us working on the same thing in another universe."

B. Bionic sheep, blueprints

"Love the book: Do Androids Dream of Electric Sheep? Electric sheep in the book look like real sheep, so I guess bionic is proper."

C. In lab, machine reading brain, codes on computer science, digital, by Beeple

"Not so much what I pictured in my mind. I've been obsessed with Pantheon (science fiction drama) recently. Briefly, I expected a picture of machine scanning human brain as code."

Rotem Dror, she/her
Postdoc in the Cognitive Computation Group

Two computers compete in a running competition

"I needed an image for my job talk presentation that would show two models competing who are going to be state of the art. My research involves developing statistical methods for comparing NLP models to determine which is better."

Hannah Gonzalez, she/her
MSE and BSE in Computer Science, 2023

A. A red fox surfing The Great Wave off Kanagawa by Katsushika Hokusai

B. Huskies sledding in Alaska by Monet

C. Macro 35mm film photography of a floating otter wearing a space suit with the Van Gogh "Starry Night" painted background

D. An Andy Warhol style painting of a corgi winking

Gaoxiang Luo, he/him
Ph.D. in Machine Learning

"I generated an image using my hobbies as keywords: cat, guitar, and latte art. I was very impressed and surprised that the AI considered the cat element as latte art!"