Data science opportunities of large language models for neuroscience and biomedicine

Large language models (LLMs) are a new asset class in the machine-learning landscape. Here we offer a primer on defining properties of these modeling techniques. We then reflect on new modes of investigation in which LLMs can be used to reframe classic neuroscience questions to deliver fresh answers. We reason that LLMs have the potential to (1) enrich neuroscience datasets by adding valuable meta-information, such as advanced text sentiment, (2) summarize vast information sources to overcome divides between siloed neuroscience communities, (3) enable previously unthinkable fusion of disparate information sources relevant to the brain, (4) help deconvolve which cognitive concepts most usefully grasp phenomena in the brain, and much more.
大型语言模型（ LLMs ）是机器学习领域的一个新资产类别。在这里，我们提供了有关定义这些建模技术属性的入门知识。然后，我们反思新的研究模式，其中LLMs可用于重新构建经典的神经科学问题，以提供新的答案。我们认为LLMs有潜力（1）通过添加有价值的元信息（例如高级文本情感）来丰富神经科学数据集，（2）总结大量信息源以克服孤立的神经科学社区之间的分歧，（3）实现以前不可想象的融合与大脑相关的不同信息源，（4）帮助解卷积哪些认知概念最能有效地掌握大脑中的现象，等等。

Introduction 介绍

Language has more human information per bit than potentially any other form of data. Natural language processing (NLP) to analyze human text has come a long way. In the early days, simple language models like n-gram models (e.g., a 2-gram treats word-word combinations as unique entities) were used to study language and semantics with various goals. Language models have at times also been used to study various cognitive tasks like reading comprehension, language translation, and question answering. Researchers compared the performance of NLP models on these tasks with human performance to gain insights into human cognition, such as in the field of psycholinguistics.
语言每比特所包含的人类信息比任何其他形式的数据都多。用于分析人类文本的自然语言处理 (NLP) 已经取得了长足的进步。早期，像 n-gram 模型（例如，2-gram 将词与词的组合视为唯一实体）这样的简单语言模型被用来研究具有各种目标的语言和语义。语言模型有时也被用来研究各种认知任务，如阅读理解、语言翻译和问答。研究人员将 NLP 模型在这些任务上的表现与人类表现进行比较，以深入了解人类认知，例如在心理语言学领域。

The rise of deep learning after ∼2010 ignited the era of semantic “embeddings” in NLP modeling: single words, sentences, paragraphs, or entire documents could be encapsulated in a compact float-vector format that denotes meaning. Intuitively, such embeddings can be thought of as locations in a high-dimensional coordinate system that enable mapping of semantic entities (word sequences) relative to their contextual similarity.¹

1.

Mikolov, T. ∙ Sutskever, I. ∙ Chen, K. ...

Distributed representations of words and phrases and their compositionality

Adv. Neural Inf. Process. Syst. 2013; 26

https://papers.nips.cc/paper_files/paper/2013/file/9aa42b31882ec039965f3c4923ce901b-Paper.pdf

Property	Value
Status
Version
Ad File
Disable Ads Flag
Environment
Moat Init
Moat Ready
Contextual Ready
Contextual URL
Contextual Initial Segments
Contextual Used Segments
AdUnit
SubAdUnit
Custom Targeting
Ad Events
Invalid Ad Sizes

Share

Share

Abstract 抽象的

Introduction 介绍

Data science perspective on large language model solutions大语言模型解决方案的数据科学视角

Emerging scaling laws of large language model solutions大型语言模型解决方案的新兴缩放定律

Large language models exhibit unprecedented transfer learning capabilities大型语言模型展现出前所未有的迁移学习能力

Foundation models as computational LEGO bricks

Large language models for biological sequences

Large language models for automated annotation

Large language model for text summarization and knowledge integration

Multi-source and multi-modal large language model synthesis

Epistemological avenues to overcoming the current concept crisis

Conclusions

Acknowledgments

Declaration of interests

References

Figures (8) 人物 (8)Figure Viewer 图浏览器

Article metrics 文章指标

Related articles (40) 相关文章 (40)

Data science perspective on large language model solutions
大语言模型解决方案的数据科学视角

Emerging scaling laws of large language model solutions
大型语言模型解决方案的新兴缩放定律

Large language models exhibit unprecedented transfer learning capabilities
大型语言模型展现出前所未有的迁移学习能力

Figures (8) 人物 (8)