WildPointer

交叉验证的原理及实现

Posted on 2018-07-13 Edited on 2026-04-04 In Machine Learning Disqus:

最近在写一个层次分类模型，为了更好地选择模型，用到了交叉验证，于是详细了解了一下。

Using Contextual Speller Techniques and Language Modeling for ESL Error Correction | Notes

Posted on 2018-04-25 Edited on 2026-04-04 In NLP Disqus:

Some notes on paper Using Contextual Speller Techniques and Language Modeling for ESL Error Correction.

使用上下文敏感的拼写检查技术和语言建模进行 ESL (English as a Second Language) 语法纠错。

Machine Learning Crash Course with Tensorflow APIs | Notes

Posted on 2018-04-02 Edited on 2026-04-05 In Machine Learning Disqus:

After work, I studied the “Machine Learning Crash Course” produced by Google and made some notes.

在工作之余学习了 Google 出品的 “机器学习速成课程”，做了些笔记。

2018 Personal Challenges

Posted on 2018-02-24 Edited on 2026-04-04 In Diary Disqus:

My personal challenges for 2018.

2018年的个人挑战。

LDA 数学笔记

Posted on 2017-12-16 Edited on 2026-04-04 In NLP Disqus:

LDA 主题模型几乎是每一个 NLP 工程师的必修课，而她背后的数学与概率论知识却让她看起来有些高冷。

那么何为 LDA (Latent Dirichlet Allocation)？

简单的说，L (Latent 隐含)，主题隐含在文档中；DA (Dirichlet Allocation 狄利克雷分布)，文档的主题服从 Dirichlet Distribution。

本文将站在一个初学者的角度来讲述 LDA 与她背后的故事。

Ngrams 语言模型与拼写校正

Posted on 2017-12-15 Edited on 2026-04-04 In NLP Disqus:

在上一篇文章中，我翻译了 Peter Norvig 的 How to Write a Spelling Corrector，其中的拼写校正器主要依赖编辑距离和词频，并不利用上下文信息，因此在 real-word error 这类问题上效果有限。本文继续沿着这个话题往前走，介绍 NLP 中最经典的一类语言模型：Ngrams 模型，以及如何把它接入一个基于编辑距离的拼写校正器中，使校正器具备感知上下文 (context-sensitive) 的能力。

如何做一个拼写校正器

Posted on 2017-12-13 Edited on 2026-04-04 In NLP Disqus:

本文翻译自 Peter Norvig 的 How to Write a Spelling Corrector

一个托福菜鸟的自我修养

Posted on 2017-10-17 Edited on 2026-04-04 In English Disqus:

Some thoughts on how to get 100+ in TOEFL.

六战托福终破百，直挂云帆济沧海。

一起作业 NLP 工程师电面

Posted on 2017-08-04 Edited on 2026-04-04 In Interview Disqus:

I took an interview with 17zuoye today. This article records the process and content of the interview.

今天参加了一起作业的 NLP 工程师面试，本文记录了面试的内容。

今日头条算法实习生面试

Posted on 2017-07-27 Edited on 2026-04-04 In Interview Disqus:

I took an interview with TouTiao.com today. This article records the process and content of the interview.

今天参加了今日头条的算法实习生面试，本文记录了面试的内容。

0%