Yu of Daphne

春秋笔法·丹枫嫩寒


  • Home

  • About

  • Tags28

  • Categories7

  • Timeline

Best practices to use Apache Spark

Posted on 2019-12-01 | In System Design | Comments:

Learning notes from DataBricks talks Optimizing File Loading And Partition DiscoveryData loading is the first step of spark application, when dataset ...

Read more »

Starting a new journey

Posted on 2019-07-01 | Edited on 2019-09-20 | In Life | Comments:

June 28th, marked the end of my journey at SurveyMonkey, a great company I had worked for more than 3 years. It’s a bittersweet heart to say goodbye. ...

Read more »

写给Daphne的诗

Posted on 2019-05-18 | Edited on 2019-09-20 | In Life | Comments:

第一章: 萌芽 你要问我,我们的故事从哪儿开始, 走出考场的那一刻,我以为将是故事的结局 而微信上的只言片语,难道只是我一如既往的淡定? 也许大家都羡慕一见钟情, 可比一见钟情更浪漫的,是一聊倾心 第二章:启 城 即便我有一双翅膀,我也会将它折断 因为唾手可得的,到头来也可能只是冷面 而纵览八百里路 ...

Read more »

Fun topics in distributed system

Posted on 2019-04-26 | Edited on 2019-09-20 | In System Design | Comments:

During the first days of learning distributed system design, we heard a lot buzzwords and technologies, and we are busy with learning one after one. ...

Read more »

Hidden Companies (Toronto)

Posted on 2019-04-20 | Edited on 2019-09-20 | In Careers | Comments:

There are a lot job websites we use to seek a job, like LinkedIn, GlassDoor, Indeed, Monster. But there is still a ton of jobs outside those popular s ...

Read more »

NLP in big companies

Posted on 2019-04-19 | Edited on 2019-09-20 | In System Design | Comments:

In this blog post, I am trying to find some good examples of building NLP applications in reality. A good starter point is to find out how some other ...

Read more »

Natural Language Processing 101

Posted on 2019-04-17 | Edited on 2019-09-20 | In Data Science | Comments:

This is a very simple and naive introductory to summary the knowledge in natural language processing, based on my self learning. What is Natural Langu ...

Read more »

Searching with bloom filter

Posted on 2018-10-16 | Edited on 2019-09-20 | In Software Development | Comments:

Problem statementOur platform is sending 4 million emails per day, and many of them contains a lot user generated content which has potential risk of ...

Read more »

Compare streaming frameworks

Posted on 2018-10-16 | Edited on 2019-12-02 | In System Design | Comments:

The first streaming framework I got to know is Apache Spark, my team owns a small spark cluster which has 1 leader and 4 followers(It is said that mas ...

Read more »

Notes on data science self learning

Posted on 2018-08-15 | Edited on 2019-09-20 | In Data Science | Comments:

Tons of resources online will get you distracted a lot, a good way is to have your own learning path and keep focus. I got this idea from two people: ...

Read more »
1234
Yu Qian

Yu Qian

Do something matters
40 posts
7 categories
28 tags
GitHub CSDN
© 2021 Yu Qian
Powered by Hexo v3.8.0
|
Theme – NexT.Gemini v7.1.0
|