估计阅读时长: 13 分钟EC Number是国际酶学委员会(IUBMB)制定的一套酶分类编号体系,EC Number采用层级分类法,由4个数字组成,分别代表酶的大类、亚类、亚亚类和序号。例如,“EC 1.1.1.37”中,第一个“1”表示氧化还原酶大类;第二个“1”表示作用于CH-OH基团;第三个“1”表示以NAD+或NADP+为受体的酶;第四个“37”表示特定酶苹果酸脱氢酶。这种层次结构意味着EC编号蕴含了丰富的功能信息,包括酶催化的反应类型和底物/机制。将EC Number嵌入为向量,有助于我们利用机器学习模型进行功能预测、相似性分析等。 Order by Date Name Attachments Capture • 14 kB • 0 […]
估计阅读时长: 11 分钟在将生物序列(如基因组或蛋白质序列)或文本数据转换为数值向量形式时,TF-IDF(Term Frequency-Inverse Document Frequency)和N-gram One-hot(又称Bag-of-n-grams)是两种经典且基础的文档嵌入算法。它们各自侧重于不同的特征提取方式,常被用于自然语言处理和生物信息学领域。 Attachments scatter_plot • 433 kB • 3 click 2026年2月10日
Recent Posts
Archives
- February 2026 (2)
- January 2026 (2)
- December 2025 (10)
- November 2025 (2)
- October 2025 (1)
- August 2025 (3)
- July 2025 (2)
- June 2025 (6)
- May 2025 (3)
- November 2023 (1)
- June 2023 (2)
- May 2023 (2)
- April 2023 (2)
- March 2023 (2)
- February 2023 (1)
- August 2022 (2)
- July 2022 (2)
- June 2022 (5)
- May 2022 (5)
- April 2022 (4)
- March 2022 (3)
- January 2022 (2)
- December 2021 (2)
- November 2021 (2)
- October 2021 (6)
- September 2021 (8)
- August 2021 (8)
- July 2021 (6)
- June 2021 (20)
- May 2021 (10)
Tags
algorithm (33)
bilibili (3)
binary tree (3)
clustering (19)
contour (3)
Darwinism (4)
dataframe (3)
data visualization (23)
dotnet-core (25)
GCModeller (20)
gdi+ (23)
gem (7)
ggplot (14)
graph (14)
heatmap (5)
http (4)
image processing (7)
kegg (8)
kmeans (3)
language (7)
linq (3)
linux (8)
machine learning (4)
mass spectrometry (12)
math (19)
metagenomics (5)
motif (4)
MSI (4)
mzkit (19)
network (8)
pathway (4)
pipeline (4)
query (5)
R# (44)
rsharp (23)
scripting (14)
single-cell (6)
sql (3)
symbolic computation (3)
text processing (4)
typescript (3)
ubuntu (4)
uniprot (3)
vb (19)
VisualBasic (50)

I'm fine, thank you. and you?
起了个头而已,等后续更新🤣
Marvelous, what a weblog it is! This web site provides helpful facts to us, keep it up.
过来围观大佬的文章
哈哈,文章写到一半烂尾啦