The Data Foundation for Natural Law Models

What if the binding constraint on the next wave of scientific AI is not compute, not talent, and not architecture but the data underneath it all? I suspect for many reading this, the instinctive response is that data is obviously important. But the gap between acknowledging its importance and actually investing in the infrastructure to make it usable is, as the past decade has taught us, rather wide.

A Genuinely New Class of AI

Over the past few years, something meaningfully different has been happening in scientific AI. Not the familiar story of language models getting larger, but something more specific and, I would argue, more consequential. AlphaFold predicted protein structures with atomic accuracy. Evo2 decoded the regulatory grammar of DNA across all domains of life. GNOME discovered 2.2 million new materials. These are not chatbots. They are not summarising papers or generating plausible-sounding text. They are learning the actual rules that govern natural phenomena. 

The Institute for Progress recently gave this class of systems a useful name: Natural Law Models (NLMs) – AI models trained on experimental data or physics-based simulations, explicitly designed to learn underlying natural phenomena. Their architectures often map onto distinct processes within the system they are studying. They are domain-specific by design. And they are producing results that general-purpose language models can’t.

When you trace the lineage of every successful NLM, you do not arrive at a clever architecture or a massive GPU cluster. You arrive at a dataset. AlphaFold was trained on the Protein Data Bank and GenBank, both products of decades of publicly funded data infrastructure. Evo2 was trained on OpenGenome2, a curated assembly of genetic sequences. GNOME and MatterGen used the Materials Project database. 

The IFP framework identifies three inputs for building NLMs: large, standardised, high-quality scientific datasets; experts with deep knowledge of both AI and the domain; and substantial compute. Of these three, data is the binding constraint in most fields– not necessarily the most expensive constraint, the binding one. The one that determines whether a model gets built at all. 

Leash Bio’s Hermes model illustrates the point well. Even a relatively simple transformer architecture can compete with state-of-the-art structure-based methods when trained on sufficient high-quality protein-ligand interaction data. Data quality and diversity matter more than model sophistication.

The FAIR Gap

This year marks 10 years since “The FAIR Guiding Principles for scientific data management and stewardship” were published. The State of Open Data report, has now analysed over 43,000 researcher responses across 10 years, reveals a persistent gap. While 85% of researchers believe open data is important for science, fewer than 30% describe their own data as truly FAIR-compliant. 

The pharmaceutical industry offers a particularly instructive case. Billions have been invested in the promise of AI-driven drug discovery. Yet what the Pistoia Alliance calls the “scientific content crisis” in pharma, critical research data remains trapped in PDFs, siloed in proprietary databases, disconnected from the experimental context that gives it meaning. It is the primary barrier to building NLMs for drug-target interaction prediction, toxicology modelling, and clinical outcome forecasting. One cannot learn the natural laws governing drug-protein interactions if half the relevant data is buried in supplementary materials that no machine can read.

Knowledge Graphs: The Connective Tissue

FAIR compliance is necessary but not sufficient. Scientific data also needs to be connected. A gene variant in ClinVar, a drug interaction in ChEMBL, a pathway annotation in Reactome, and a population frequency in gnomAD are individually useful. Linked together in a knowledge graph, they become the substrate for a class of reasoning that neither isolated databases nor language models can achieve alone. 

Knowledge graphs provide the semantic layer that transforms FAIR-compliant data into NLM training sets. They encode not just facts but relationships that a gene encodes a protein that interacts with a drug that treats a disease with a known population prevalence. This relational structure is precisely what NLMs need to learn the “natural laws” governing biological systems.

Platforms such as metaphacts’ metaphactory make this semantic layer operational. metaphactory is an enterprise knowledge graph platform that supports semantic knowledge modeling and knowledge discovery, enabling organizations to integrate heterogeneous datasets into a unified, ontology-driven graph that becomes the backbone for AI-native applications and exploration workflows. By providing a structured semantic model and an environment for discovery and collaboration, it turns disconnected life sciences data into a coherent, machine-interpretable foundation for reasoning systems. For organizations that need a high-quality scientific backbone out of the box, the Dimensions Knowledge Graph, powered by metaphactory, provides a ready-made research graph built on more than 32 billion structured statements derived from the Dimensions global research database and public data sources such as STRING and UMLS, enriched with domain ontologies. This gives teams an immediate, large-scale relational substrate that can be extended with proprietary assay data, clinical results, or real-world evidence, accelerating the transition from fragmented resources to a unified knowledge infrastructure.

The most effective AI systems for biology are not the ones with the most parameters. They are the ones with the best data infrastructure. As AI agents become central to biological research, how do we know these systems are actually working?

Phylo’s recent analysis of biology agent benchmarks reveals a challenge: failures in biology are silent. A flawed analysis can produce plausible-looking results that propagate through research pipelines for months before anyone catches the error. This is not like a chatbot hallucinating a historical date; this is a computational analysis producing a biologically plausible but incorrect result that informs real experimental decisions. Once that semantic backbone exists, AI systems must be grounded in it. 

Metis, metaphacts’ knowledge-driven AI platform, combines large language models with knowledge graphs to deliver semantically precise, contextualized insights. By explicitly grounding AI outputs in the underlying semantic layer and providing built-in quality control and explainability, it reduces hallucinations and makes it possible to trace how answers were generated. In biology, where failures are often silent and plausible-looking results can propagate through pipelines before errors are detected, this traceability is essential.

The response from the field has been the development of trace-based evaluation – scoring agents on their analytical process, not just their final answers. Phylo’s BiomniBench framework evaluates five facets – data handling, tool selection, statistical rigour, source reliability, and reasoning chain coherence. This mirrors how science itself works: peer review scrutinises methods and reasoning, not just conclusions. An agent’s analytical trace is only auditable if every data source is identified, every transformation is logged, and every inference is traceable to its evidential basis. This is, in effect, a restatement of the FAIR principles applied to AI reasoning itself. Provenance is not a nice-to-have. It is the foundation of trustworthy AI in science.

An Asymmetric Bet for Pharma

Organisations that invest in FAIR data infrastructure and knowledge graphs today are not merely improving their current research workflows– they are building the training data for tomorrow’s NLMs. Even if general-purpose LLMs eventually supersede domain-specific NLMs (the “bitter lesson” argument), the underlying datasets will still be essential as training data, fine-tuning corpora, or evaluation benchmarks.

The practical implications follow directly. Pharmaceutical companies should invest in connecting their internal experimental data to public knowledge graphs using standardised ontologies and persistent identifiers. Research funders should prioritise data infrastructure alongside experiment funding. Technology providers should build knowledge graph platforms that make FAIR compliance the default, not the exception.

AlphaFold succeeded because the Protein Data Bank existed. Decades of painstaking, publicly funded data curation created the substrate on which a model could learn the rules of protein folding. The question for every scientific field is “what is our Protein Data Bank, and are we building it?” The organisations that answer this question first will define the trajectory of AI-driven discovery for the next decade. 

The data infrastructure being built today is not about better workflows or compliance checklists. It is the foundation layer for a class of AI that is already producing genuine scientific breakthroughs.

The models will keep improving. But they can only be as good as the data they are trained on.

The post The Data Foundation for Natural Law Models appeared first on Digital Science.



from Digital Science https://ift.tt/7qTHDb8

πŸ’§ Advanced Nanoarchitectonics in Hexagonal Boron Nitride-Based Functional Materials for Water Treatment♻️


 Advanced nanoarchitectonics in hexagonal boron nitride (h-BN)-based functional materials is transforming the future of water treatment πŸ’§. Known for its exceptional thermal stability, chemical resistance, and layered structure, h-BN provides an ideal platform for designing high-performance nanomaterials. By precisely engineering its surface chemistry and morphology, researchers are unlocking powerful adsorption and catalytic capabilities πŸ”¬.

Through nanoarchitectonic strategies such as surface functionalization, defect engineering, and hybrid composite formation, h-BN materials can effectively remove heavy metals, organic dyes, pharmaceuticals, and emerging contaminants from wastewater 🌊. These tailored nanostructures offer high surface area, improved selectivity, and enhanced reusability, making them highly efficient and cost-effective for sustainable purification systems ♻️.

Moreover, integrating h-BN with photocatalysts, membranes, and magnetic nanomaterials further enhances separation efficiency and regeneration performance 🚰. Such advanced designs support scalable, eco-friendly water treatment technologies that address global clean water challenges 🌍. Nanoengineered h-BN is paving the way toward smarter, greener environmental solutions ✨.

Global Scholar Awards 🌟

Visit Our Website 🌐: globalscholarawards.com Nominate NowπŸ‘: https://globalscholarawards.com/doctor-awards-nobel-prize-scientists-award-nomination/?ecategory=Awards&rcategory=Awardee Contact us ✉️: contact@globalscholarawards.com

Get Connected Here:
=================
Twitter : x.com/ScienceInventi1 Youtube        :  youtube.com/@nesinconferenceandawards4869 
Pinterest : in.pinterest.com/scienceinventions/ Instagram : instagram.com/kaylee_rowan_ Linkedin : linkedin.com/in/new-science-inventions-09664427b Blog : newscienceinventions2020.blog...

#WorldResearchAwards #ResearchAwards #AcademicAwards #ScienceAwards #GlobalResearchAwards #shorts #Scifax #researchers #labtechnicians #conference #awards #professors #teachers #lecturers #biologybiologiest #physicist #ai #rain #network #coordinator #business #motor #medicine #mechanic #finance #bestreseracher #bestpaper #monry #awards #vision #research #eyes #researchawards #Environment #GoGreen #EarthLovers #shorts

REF readiness: evidencing Engagement & Impact

Retrospective storytelling alone cannot be used to evidence research impact. And that is one core point underlined in the 2029 Research Excellence Framework (REF); in fact it highlights that impact is assessed on whether reach and significance can be clearly evidenced and traced back to the research question itself, and whether institutions can demonstrate a clear, credible connection between research activity and real-world change.

Moreover, analysis of REF 2021 impact case studies reinforces an important reality: impact is rarely a linear, end-of-project activity. Instead, it is built through engagement, partnerships and co-production across the research lifecycle. REF 2029 now makes this explicit, recognising these non-linear and engagement-led pathways to impact.

“For universities, quite often the challenge is not generating the impact itself but having access to a joined-up evidence-based view of where and how impact is already beginning to emerge… early enough to be recognised, sustained and built upon,” says Ann Campbell, Director Research Impact & Comparative Analytics at Digital Science, drawing on her experience supporting REF submissions in a previous university role as Research Systems and Data Manager.

Impact often starts with research embedded beyond academia

“Analysing the REF 2021 Impact Case Study data helps clarify where impact really begins,” says Campbell. Of the 6,361 impact case studies submitted, 6,045 included underpinning research publications that could be identified and matched in Dimensions using DOIs, ISBNs and bibliographic metadata. “This allowed us to explore patterns in authorship, collaboration, and external engagement,” she explains.

Within these 6,045 case studies:

  • 1,193 were underpinned by research co-authored with healthcare partners
  • 1,040 involved government partners
  • 913 involved industry
  • 866 involved non-profit organisations

“This shows that a substantial proportion of REF impact is built on research already embedded in external systems, not simply disseminated afterwards,” she explains.  Across disciplines, different pathways are visible: health impact is dominated by healthcare-embedded research; technological impact often emerges from industry-embedded R&D; and environmental impact is closely linked to government and NGO networks.

Interconnected data in Dimensions helps universities see where their research is already connected to the world beyond academia, whether through hospitals, government bodies, industry or public organisations. This makes it possible to identify areas of research that are naturally well positioned to generate impact, early in the cycle.

“It is, however, important to emphasise that this kind of analysis does not select impact case studies or predict REF outcomes. Rather, it supports better understanding of where strong impact is already emerging, so institutions can recognise it sooner and support it more effectively over time,” says Campbell. “Crucially, this kind of analysis also helps institutions identify case studies that already have the building blocks of a REF-ready narrative – where links between underpinning research, external uptake, and corroborating evidence are clearer, traceable and easier to articulate.”

Engagement needs to be captured early, not reconstructed later

Even when research is externally embedded, evidence of engagement is often fragmented or lost in advisory roles, policy input or informal collaborations.  Over time, staff movement, changing roles and organisational turnover can erode institutional memory, making it harder to reconstruct how engagement unfolded and how impact developed.This creates risk when institutions later need to reconstruct timelines and narratives under time pressure.

REF 2029 explicitly requires units to describe how engagement and partnerships enabled impact, not just the outcomes. This means engagement activity should be recorded as it happens, not retrospectively inferred.

That is why it becomes important to have systems like Symplectic Elements that provide structured capture of engagement activity and linking it to related data such as people, publications and grants. In practice, it acts as an institutional memory layer, supporting traceability and consistency without assessing impact itself.

Impact relies on materials, not just publications

Impact pathways differ sharply by discipline. Health impact is evidenced through clinical guidelines and policy; technological impact through patents and translation; environmental impact through policy reports and media; and social and cultural impact through public and professional discourse.

Many of these pathways rely on outputs beyond the traditional journal article including datasets, reports, briefings, tools and other materials that support engagement and use. In many cases, these sit alongside conventional publications as part of a broader impact narrative. Yet these materials are often poorly preserved, uncitable or disconnected from the research record.

Research repository services such as Figshare support institutions by providing persistent access to impact-supporting materials, assigning DOIs and stable landing pages, and enabling transparency and reuse. Institutions may use Figshare, for example, either as a dedicated repository for non-traditional research outputs, or as a full institutional repository capturing both traditional publications, datasets,  and wider research outputs, depending on their infrastructure and needs.

Making external uptake visible

REF 2021 impact narratives and their underpinning research leave clear, traceable signals of real-world use, from policy and clinical guidelines to patents, Wikipedia and news. The mix of these signals varies by discipline, revealing different pathways from research to real-world change.

Tools like Altmetric reveal where research is being cited in policy and guidelines, taken up in patents, and discussed in media and public discourse. It does not just measure impact, but provides external confirmation of engagement and uptake, aligned with how REF panels assess reach and significance.

From reactive reporting to impact readiness

REF impact is built through sustained interaction with the world beyond academia. Institutions that invest early in understanding, capturing and evidencing engagement are better placed to meet REF expectations with confidence. Digital Science supports this readiness by strengthening evidence and decision-making while leaving judgement firmly with institutions and REF panels.

REF readiness is about leading, not lagging. It means planning ahead and building on early engagement signals, not relying on retrospective evidence gathering at the end of the research cycle.

Institutions that prepare most effectively:

  • understand where their research connects with the world, with interconnected data solutions, e.g., Dimensions
  • capture and structure engagement activity as it happens, linking people, publications and grants through systems such as Symplectic Elements 
  • preserve the materials that support impact claims by investing in a repository that mints research data and NTRO’s, like Figshare
  • validate reach and uptake beyond the institution using tools such as Altmetric

Together, these become the building blocks of REF-ready impact narratives: clearer to articulate, easier to evidence, and more credible to defend.

The post REF readiness: evidencing Engagement & Impact appeared first on Digital Science.



from Digital Science https://ift.tt/Z7OBAKz

🧲 Developing Sustainable Bioreactors with Magnetically Actuated Smart Materials🌱


 Developing sustainable bioreactors using magnetically actuated smart materials is opening new horizons in biotechnology and green engineering. 🌱🧲 These innovative systems use magnetic fields to control internal mixing, cell stimulation, and fluid dynamics without traditional mechanical agitators. This contactless operation reduces contamination risks and improves overall process efficiency. πŸ§ͺ

Magnetically responsive polymers and nanoparticles enable precise, real-time control of biological environments. ⚙️ By adjusting magnetic signals, researchers can enhance nutrient distribution, oxygen transfer, and metabolic activity inside the reactor. This smart control leads to higher yields in applications such as biofuel production, tissue engineering, and pharmaceutical manufacturing. πŸ’Š

Sustainability is a central advantage of these advanced bioreactors. 🌍♻️ Reduced energy consumption, lower mechanical wear, and optimized resource use make them environmentally friendly and cost-effective. As innovation continues, magnetically actuated smart materials are set to revolutionize scalable, eco-conscious bioprocessing technologies worldwide. πŸš€

Global Scholar Awards 🌟

Visit Our Website 🌐: globalscholarawards.com Nominate NowπŸ‘: https://globalscholarawards.com/doctor-awards-nobel-prize-scientists-award-nomination/?ecategory=Awards&rcategory=Awardee Contact us ✉️: contact@globalscholarawards.com

Get Connected Here:
=================
Twitter : x.com/ScienceInventi1 Youtube        :  youtube.com/@nesinconferenceandawards4869 
Pinterest : in.pinterest.com/scienceinventions/ Instagram : instagram.com/kaylee_rowan_ Linkedin : linkedin.com/in/new-science-inventions-09664427b Blog : newscienceinventions2020.blog...

#WorldResearchAwards #ResearchAwards #AcademicAwards #ScienceAwards #GlobalResearchAwards #shorts #Scifax #researchers #labtechnicians #conference #awards #professors #teachers #lecturers #biologybiologiest #physicist #ai #rain #network #coordinator #business #motor #medicine #mechanic #finance #bestreseracher #bestpaper #monry #awards #vision #research #eyes #researchawards #Environment #GoGreen #EarthLovers #shorts

🌱Sustainable Transformation of Waste Cooking Oil: Global Valorization Pathways & Future Directions♻️


 The sustainable transformation of waste cooking oil (WCO) is gaining global attention as industries and researchers seek eco-friendly alternatives to fossil-based resources 🌱♻️. Instead of being discarded and causing water and soil pollution, WCO can be converted into valuable products like biodiesel, bio-lubricants, soaps, and bioplastics. This approach supports the circular economy πŸ”„ by turning everyday kitchen waste into renewable energy and green materials 🌍.

Across the world, innovative valorization pathways are emerging ⚙️πŸ”₯. Technologies such as transesterification, pyrolysis, and hydroprocessing enable efficient conversion of waste oil into clean fuels and high-value chemicals. Countries are also implementing collection systems and supportive policies to streamline supply chains and reduce environmental impact. These advancements not only lower greenhouse gas emissions 🌎 but also create economic opportunities and green jobs πŸ’Ό.

Looking ahead, future directions focus on improving process efficiency, reducing production costs, and integrating advanced catalysts and AI-based optimization πŸ€–πŸ§ͺ. Collaboration between academia, industry, and policymakers is crucial for scaling up sustainable solutions. By embracing innovation and responsible consumption, the global community can transform waste cooking oil into a powerful driver of sustainable development and energy security ⚡🌿.

Global Scholar Awards 🌟

Visit Our Website 🌐: globalscholarawards.com Nominate NowπŸ‘: https://globalscholarawards.com/doctor-awards-nobel-prize-scientists-award-nomination/?ecategory=Awards&rcategory=Awardee Contact us ✉️: contact@globalscholarawards.com

Get Connected Here:
=================
Twitter : x.com/ScienceInventi1 Youtube        :  youtube.com/@nesinconferenceandawards4869 
Pinterest : in.pinterest.com/scienceinventions/ Instagram : instagram.com/kaylee_rowan_ Linkedin : linkedin.com/in/new-science-inventions-09664427b Blog : newscienceinventions2020.blog...

#WorldResearchAwards #ResearchAwards #AcademicAwards #ScienceAwards #GlobalResearchAwards #shorts #Scifax #researchers #labtechnicians #conference #awards #professors #teachers #lecturers #biologybiologiest #physicist #ai #rain #network #coordinator #business #motor #medicine #mechanic #finance #bestreseracher #bestpaper #monry #awards #vision #research #eyes #researchawards #Environment #GoGreen #EarthLovers #shorts

πŸ›️Status and Subfield: Mapping Sociological Specializations Across Departments πŸ“Š


The discipline of sociology has evolved into a diverse landscape of specializations, ranging from gender studies and urban sociology to digital sociology and environmental research. Across universities, the distribution of these subfields often reflects institutional priorities, funding patterns, and societal needs. Some departments emphasize quantitative research and policy analysis, while others prioritize critical theory or community-based engagement. πŸ“š✨

The status of a subfield within a department can shape hiring decisions, curriculum design, and research visibility. Well-established areas such as medical sociology or criminology may receive stronger institutional support, while emerging areas like digital inequality or climate migration are gradually gaining recognition. These patterns influence collaboration networks, publication output, and student enrollment trends. πŸ”πŸ“ˆ

Understanding how sociological specializations are distributed helps reveal broader academic power structures and intellectual trends. It also encourages departments to evaluate balance, inclusivity, and innovation within their programs. By recognizing both dominant and emerging subfields, sociology can remain responsive to global challenges and evolving social realities. 🌍🀝

Global Scholar Awards 🌟

Visit Our Website 🌐: globalscholarawards.com Nominate NowπŸ‘: https://globalscholarawards.com/doctor-awards-nobel-prize-scientists-award-nomination/?ecategory=Awards&rcategory=Awardee Contact us ✉️: contact@globalscholarawards.com

Get Connected Here:
=================
Twitter : x.com/ScienceInventi1 Youtube        :  youtube.com/@nesinconferenceandawards4869 
Pinterest : in.pinterest.com/scienceinventions/ Instagram : instagram.com/kaylee_rowan_ Linkedin : linkedin.com/in/new-science-inventions-09664427b Blog : newscienceinventions2020.blog...

#WorldResearchAwards #ResearchAwards #AcademicAwards #ScienceAwards #GlobalResearchAwards #shorts #Scifax #researchers #labtechnicians #conference #awards #professors #teachers #lecturers #biologybiologiest #physicist #ai #rain #network #coordinator #business #motor #medicine #mechanic #finance #bestreseracher #bestpaper #monry #awards #vision #research #eyes #researchawards #Environment #GoGreen #EarthLovers #shorts


 

πŸ” The Intersection of Human Rights and Mandatory Arbitration: An Overview


In today’s evolving legal landscape, the relationship between human rights and mandatory arbitration ⚖️ has become a topic of growing importance. Mandatory arbitration requires parties to resolve disputes outside traditional courts, often through private tribunals. While it offers efficiency, reduced costs, and quicker resolutions ⏳, concerns arise about whether it adequately protects fundamental human rights such as access to justice, equality, and due process 🌍.

One key issue is the potential imbalance of power between corporations and individuals 🀝. Employees, consumers, or marginalized groups may be required to sign arbitration agreements without fully understanding their implications. This can limit transparency and reduce opportunities for public scrutiny πŸ”. Critics argue that private proceedings may restrict appeals and set fewer procedural safeguards compared to open court systems πŸ›️.

However, supporters believe arbitration can provide flexible and specialized dispute resolution mechanisms πŸ“š. When designed with fairness and accountability in mind, it can complement judicial systems rather than undermine them. The challenge lies in striking a balance ⚖️—ensuring efficiency while upholding the core principles of human rights, dignity, and equal protection under the law ✨.

Global Scholar Awards 🌟

Visit Our Website 🌐: globalscholarawards.com Nominate NowπŸ‘: https://globalscholarawards.com/doctor-awards-nobel-prize-scientists-award-nomination/?ecategory=Awards&rcategory=Awardee Contact us ✉️: contact@globalscholarawards.com

Get Connected Here:
=================
Twitter : x.com/ScienceInventi1 Youtube        :  youtube.com/@nesinconferenceandawards4869 
Pinterest : in.pinterest.com/scienceinventions/ Instagram : instagram.com/kaylee_rowan_ Linkedin : linkedin.com/in/new-science-inventions-09664427b Blog : newscienceinventions2020.blog...

#WorldResearchAwards #ResearchAwards #AcademicAwards #ScienceAwards #GlobalResearchAwards #shorts #Scifax #researchers #labtechnicians #conference #awards #professors #teachers #lecturers #biologybiologiest #physicist #ai #rain #network #coordinator #business #motor #medicine #mechanic #finance #bestreseracher #bestpaper #monry #awards #vision #research #eyes #researchawards #Environment #GoGreen #EarthLovers #shorts

 

Featured Post

The Data Foundation for Natural Law Models

What if the binding constraint on the next wave of scientific AI is not compute, not talent, and not architecture but the data underneath it...

Popular