By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
World of SoftwareWorld of SoftwareWorld of Software
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Search
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
Reading: This AI Reads Science Papers Like a Pro, Even When Humans Can’t Agree on the Words | HackerNoon
Share
Sign In
Notification Show More
Font ResizerAa
World of SoftwareWorld of Software
Font ResizerAa
  • Software
  • Mobile
  • Computing
  • Gadget
  • Gaming
  • Videos
Search
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Have an existing account? Sign In
Follow US
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
World of Software > Computing > This AI Reads Science Papers Like a Pro, Even When Humans Can’t Agree on the Words | HackerNoon
Computing

This AI Reads Science Papers Like a Pro, Even When Humans Can’t Agree on the Words | HackerNoon

News Room
Last updated: 2025/04/18 at 6:02 PM
News Room Published 18 April 2025
Share
SHARE

Authors:

(1) Yanpeng Ye, School of Computer Science and Engineering, University of New South Wales, Kensington, NSW, Australia, GreenDynamics Pty. Ltd, Kensington, NSW, Australia, and these authors contributed equally to this work;

(2) Jie Ren, GreenDynamics Pty. Ltd, Kensington, NSW, Australia, Department of Materials Science and Engineering, City University of Hong Kong, Hong Kong, China, and these authors contributed equally to this work;

(3) Shaozhou Wang, GreenDynamics Pty. Ltd, Kensington, NSW, Australia ([email protected]);

(4) Yuwei Wan, GreenDynamics Pty. Ltd, Kensington, NSW, Australia and Department of Linguistics and Translation, City University of Hong Kong, Hong Kong, China;

(5) Imran Razzak, School of Computer Science and Engineering, University of New South Wales, Kensington, NSW, Australia;

(6) Tong Xie, GreenDynamics Pty. Ltd, Kensington, NSW, Australia and School of Photovoltaic and Renewable Energy Engineering, University of New South Wales, Kensington, NSW, Australia ([email protected]);

(7) Wenjie Zhang, School of Computer Science and Engineering, University of New South Wales, Kensington, NSW, Australia ([email protected]).

Editor’s note: This article is part of a broader study. You’re reading Part 3 of 9. Read the rest below.

Table of Links

Data preparation and schema design

Material experts undertook the annotation of nine distinct categories from 75 abstracts of research papers, act as LLM’s training dataset. These categories encompassed core labels – “Name,” “Formula,” or “Acronym” – which consistently represent specific materials across diverse papers, albeit in varied forms. Additionally, supplementary labels included “Descriptor,” “Structure/Phase,” “Application,” “Property,” “Synthesis,” and “Characterization.” The “Property” label served to delineate quantitative properties such as ’length,’ ’specific surface area,’ and ’mass,’ whereas the “Descriptor” label encapsulated qualitative attributes like ’stable,’ ’vertically,’ and ’safe.’ Since from the application aspects, developers always concern about its own properties, we also annotated the essential parameters as “Property” directly linked to specific “Application”. For instance, attributes like ’specific capacity’ for lithium-ion batteries, ’energy conversion efficiency’ for solar cells, and ’H2 production rate’ for hydrogen evolution reaction were systematically annotated.

As the Figure 2 shows, the central node, “Material,” is pivotal and connects to nodes that describe its nomenclature, composition, and various attributes. Specifically, “Material” is consisted of “Formula”, “Name” and “Acronym”. “Structure/Phase” describing its physical form; “Application” denoting its practical uses; and “Property” outlining its inherent characteristics. The “Material” node is also associated with “Descriptor” node that provide additional qualitative information. Moreover, the “Application” node branches out to “Property”, “Descriptor” and a “Domain” suggesting further specification and contextual relevance of application. To save the source of nodes and relations, each node is connected to a “Digital Object Identifier (DOI)” node. If we want to determine the source article from which the relation is derived, we can obtain the result by querying the intersection of “DOI” node neighbors at both ends of the relation.

Figure 2. This schematic represents the FMKG schema (a) All the nodes in FMKG connect to all their sources - "DOI". (b) Relations between all labels. (c) Relations between core labels, the more arrows pointing to the label, the lower the priority it has.Figure 2. This schematic represents the FMKG schema (a) All the nodes in FMKG connect to all their sources - "DOI". (b) Relations between all labels. (c) Relations between core labels, the more arrows pointing to the label, the lower the priority it has.

Given the inherent complexity of sentences and the variability in terminologies across abstracts, a normalization process was employed after the initial extraction. This normalization ensured uniform representation of entities with similar meanings. For example, terms such as ’Lithium-Ion Battery’ and ’Li-ion batteries’ were standardized to ’lithium-ion battery,’ while phrases like ’solution casting method,’ ’solvent post-treatment method,’ and ’solution-based deposition’ were simplified to ’solution-processed.’ All annotated entities underwent this normalization process, ensuring consistency and facilitating effective training of LLMs. The field of functional materials encompasses a wide range of areas. At this stage, our priority is to focus on energy materials. We download 150,000 abstracts of peer-reviewed research articles of energy material science includes battery, solar cell and catalyst from the Web of Science. Each abstract was stored in a JSON file format, structured as “DOI – text,” facilitating seamless processing and analysis.

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.
By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Twitter Email Print
Share
What do you think?
Love0
Sad0
Happy0
Sleepy0
Angry0
Dead0
Wink0
Previous Article Keri Hilson’s ‘We Need to Talk’ marks her return to music after a 15-year absence
Next Article Major leak reveals how this year’s new Moto Razr Plus is set to arrive
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Stay Connected

248.1k Like
69.1k Follow
134k Pin
54.3k Follow

Latest News

NOI, Not Noise: Why Arjun Kannan’s Startup is Yelling at Chatbots on Behalf of Property Managers
Gadget
Free Character Profile Template for Writers & Creators |
Computing
This survival saga from Argentina is blowing up on Netflix – why it’s a must-watch
News
Whoop angers users over reneged free upgrade promises
News

You Might also Like

Computing

Free Character Profile Template for Writers & Creators |

20 Min Read
Computing

The State of Cloud Storage: #Decentralize-Cloud | HackerNoon

10 Min Read
Computing

How to Implement Account-based Marketing: 7 Real-Life Examples

32 Min Read
Computing

What Happens When Blockchain Miners Cheat the System | HackerNoon

8 Min Read
//

World of Software is your one-stop website for the latest tech news and updates, follow us now to get the news that matters to you.

Quick Link

  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact

Topics

  • Computing
  • Software
  • Press Release
  • Trending

Sign Up for Our Newsletter

Subscribe to our newsletter to get our newest articles instantly!

World of SoftwareWorld of Software
Follow US
Copyright © All Rights Reserved. World of Software.
Welcome Back!

Sign in to your account

Lost your password?