“Help people keep in touch informed … It’s my motivation”

Once in time, we could certainly draw on the information threads around us and weave them to useful knowledge because the higher quality threads tended to stand out. Today, when we sweep with an Information Tsunami, it can be difficult to know what to achieve, what information to trust. Amazon Scholar Heng Ji, professor of computer science at the University of Illinois Urban-Champaign (UIUC), has made it the work of his life to help us separate the signal from noise.

Amazon Scholar Heng Ji Leads Blender Lab, where she seeks to promote a future where computers will be able to distinguish precisely, successful and reliable knowledge.

“It’s a challenge, but if we don’t work on it, this will be a serious societal problem,” says JI, who also directs Amazon-Aill-ortificial Intelligence for Interactive Conversation Experiences (AICE). “Help people keep in touch so they can make good choices: It’s my motivation.”

Related content

The center will support UIUC researchers in their development of new approaches to conversation AI systems.

To this end, JI Blender Lab leads at UIUC, where she seeks to promote a future with information accessibility, where computers will be able to discern precisely, concise and reliable knowledge from the information swirling through this tsunami. Not only that, she says, we will also have access to access this reliable knowledge by talking to computers using natural language.

“We want to know who did what, to whom, where and when units, events and actions, claims and counterclaims, their interconnections and then makes sense of it all,” says Ji.

The most important approach JI brings on this challenge is Natural-Language Processing (NLP) and her groundbreaking work in information extraction (ie).

Property situation

The roots of you can be traced back to The Message Unity Conference (MUC), a number of events that Defense Advanced Research Projects Agency started in the late 1980s. The program was co-led by Ralph Grishman, who would later become JI’s PhD advisor. Today, Ji IE brings back to her roots with a technology, such as her team dream in March, called Smartbook, with the support of the Defense Advanced Research Project Agency (Darpa) and the US National Science Foundation.

In times of disaster, such as a global pandemic or continuous conflicts such as the Russian invasion of Ukraine, good decision making requires to gather understanding of the intelligence of reality on earth. In conflicts, this Intel is reduced to a situation reports (Sitreps).

Analysts and humanitarian workers must gather and digest great love for up -to -date daily documents and then combine it with extensive local and cultural knowledge and the wider dynamics of a disaster. Only then can analysts create useful Sitreps that military leaders or politicians can use to make strategic decisions. It’s a tough process to automate.

By 2022, JI encountered the nonprofit organization data program that produces a situational analysis of the Ukraine crisis every two weeks.

“I wanted to help this group by automating a first draft of their Sitreps so that they control time on what they are really good at – using their expertise to shape this draft, add strategically important information and make recommendations.”

What JI and her partners, led by Clare Voss at the US Army Research Laboratory, Cam Up with was Smartbook Framework. The Ukraine crisis as a case study digests the SmartBook Store Amanzes of News Data from the Internet and automatically extracts information, including events, places, people, weapons and military actions, and all together to produce Sitreps.

The reports are structured in timelines with major events such as chapters, with relevant strategic issues used as section headings and similar summary across requirements based on links to the information sources (Fig. 1). Everything is automatic.

Fig. Follow the pink sections to see how an example of two weeks of timeline is chaptered as a series of key events, with each event branching out to section headlines that are related strategic questions. Each strategic question, in turn, is linked to being claims, each supported by invoice vouchers and associated nowledge elements (devices and events).

While Smartbook uses large language models (LLMs) to produce Summmaries (Fig. 1 above, bottom right) on extraced requirements from news sources, it’s only one of many components of the Smartbook frame. Chatgpt alone, for example, could not generate a structured Sitrep, not least because it is not trained on updated information. And LLMs tend to hallucinate, generate information or “answers” that are not group in the source news data, leading to output that can be inaccurate, misleading or completely manufactured.

When an expert analyst was asked to edit Sitreps produced by Smartbook, they added more details to the document, but removed only approx. 2% of the content. “This indicates that the Smartbook can serve as a good starting point for analysts to expand for generating situation reports,” says JI.

This early iteration of SmartBook Raies on news reports in English, but JIS Team is currently increasing various sources of information and languages ​​to produce a more rounded image.

Drug discovery

Another of Ji’s passions is to apply her skills to support the discovery of narcotics. JI consumes a future where a doctor can write a few sentences that describe a tailor -made drug to treat a specific patient and then receive the exact structure of a drug with the desired properties that could in turn be tested and synthesized into order. Currently, the development of a single new drug can take over a decade and costs up to a billion dollars.

Marinka Zitnik.jpg

Related content

ARA headline Marinka Zitnik is focused on how machine learning can enable Accord diagnoses and the development of new treatments and therapies.

JI and her team developed a new learning framework that together represent molecules and languages ​​and enable translations between the two. “I was trained as a computational linguist, so I tend to see Everhynness as a foreign language, and it included molecules, pictures or videos,” she says.

The framework is called the Molt5-A self-monitored-learning framework for prior models on the huge of unmarked, natural-linguistic text and molecular strings (a notation system representing molecular structure). Given a molecule -Thong, JI and her team report that Word Mott5 will give a text description that included the medical, atomic and chemical properties of the molecule. On the flip side, give Molt5 with a description of desired molecular properties and it generates the string to a molecule that best fits that description.

The idea is that Molt5 or its descendants will allow herds to utilize AI technologies to detect new drugs that use natural linguistic descriptions.

Human interactions

In March this year, JI helped strengthen the relationship between Amazon and UIUC by becoming founder of AICE. AICE aims to develop new conversation -AI systems that can automatically learn, reason, update their own nowledge and interact in multiple modalities.

“If your digital assistant could also read the books and watch the movies you have had, they will be able to conduct much more now, informative and interesting interviews with you,” says Ji. “It would make it interact with them more natural – more human.”

Another focus on AICE is to improve truth, justice and transparency in conversation AI systems.

Can the modern information tsunami really Temmes? “There is a change between creativity and truth,” says Ji, “but yes, I think we can design new algorithms to achieve both goals.”

Conversation-II Boom

After spending his career working in NLP, what would Ji tell students who consider it a research area, especially in light of LLM boom?

“Keep your optimism first! This LLM wave is exciting, even though it has hit a lot of studs hard, especially those already in the middle of the dissertation,” says Ji. “While LLMs seem to close some research options, the important new ones, such as structured prediction, cross-cutting document-reasoning, theoretical understanding of LLMS, Bill-Error correction and so many more.”

Belinda Zeng, head of Applied Science and Engineering at Amazon Search Science and AI, seen standing outside in Costa Rica on Sunny Day, a wire fence is just behind her in the foreground, and a valley and mountains are seen in the background

Career counseling

Belinda Zeng, head of Applied Science and Engineering at Amazon Search Science and AI, shares her views.

Ji also notes the Chinese saying “frequent movements make a tree die, but a person prosperous” and recommends mixing academic research and industry research. Ji has worked with the Alexa organization himself in its capacity as Amazon -Lærd since March. “I things Amazon because it gave the opportunity to take a problem in the real world,” she says. For example, JI works with LLM team at Amazon to, among other things, develop systems to minimize and prevent hallucinations.

“With Amazon, I want the ideas I contribute to become part of the next generation of AI systems and for many customers to feel the benefit of it. It is a very different way of measuring success compared to academia and it is refreshing.”

Leave a Comment