codejam2015-round2-pbC 的三种解法

Fri, 27 May 2016 Category tech algorithm codejam

昨天做的一道codejam题目, 这个题目的三种解法都非常有代表性, 特此一记.

题目链接在这里:
https://code.google.com/codejam/contest/8234486/dashboard#s=p2

Elliot's parents speak French and English to him at home. He has heard a lot of words, but it isn't always clear to him which word comes from which language! Elliot knows one sentence ...

在java程序里使用weka进行机器学习

Sat, 21 May 2016 Category soft ml weka java

之前一直用weka的GUI界面做机器学习的任务, 感觉这个软件虽然界面丑, 不过确实是快速开展机器学期的利器. 关于GUI的weka使用以后有时间再写. 今天这篇记录一下最近使用的java版本的weka.

1. Include jars into project

weka官网的下载链接里选择linux版本的weka压缩包即可, 下载以后找到weka.jar文件, 在工程里将其include一下就可以使用了(btw, 现在开始放弃eclipse, 进入IDEA的怀抱了...).

weka的文档在解压缩的文件里有, 另外在线文档在: http://weka.sourceforge.net/doc.stable-3-8/

about libsvm...

关于libsvm需要有一点特别指出. weka自带的算法里是不包含libsvm的 (有个类似的SMO, 不过还是libsvm久经考验啊...), 需要使用weka的package manager安装. 打开package manager是在weka主界面的菜单里:

在package manager里搜索到libsvm安装即可. 然后(linux下)在主目录可以看到有个wekafiles文件夹, wekafiles/packages/LibSVM/目录下就是libsvm的内容.

需要指出的一点是, 要使用libsvm的话, 需要同时引用两个jar文件, 而且都叫libsvm.jar!!

这两个jar ...

[Algorithms II] Week 6-3 Intractability

Tue, 23 Feb 2016 Category notes algorithm Series Part 13 of «Algorithms Princeton MOOC II»

1. Introduction to Intractability

recall model of computation: DFA
a univeral model of computation: turing machine
→ no more powerful model of computation.
Turing machine can compute any function that can be computed by a physically harnessable process of the natural world.

bottom line: turing machine is a simple and universal ...

[Algorithms II] Week 6-2 Linear Programming

Sun, 21 Feb 2016 Category notes algorithm Series Part 12 of «Algorithms Princeton MOOC II»

simplex algo: top 10 algo of the 20th century (ever?).

what is linear programming:
a general problem-solving model that works for:
shortest-path, maxflow, MST, matching, assignment, ...

1. Brewer-'s Problem

toy example: choose products to maximize profit.
...
feasible region: a convex polygon.

⇒ optimum solution appears at an extreme point.

standard ...

[python进阶课程] 面向对象编程

Fri, 19 Feb 2016 Category notes python Series Part 2 of «python进阶课程»

http://www.imooc.com/learn/317

模块和包

包: 文件夹 (可以有多级), 且包含__init__.py文件(每层都要有) 模块: py文件

代码分开放在多个py文件(模块名=文件名). 同名变量互不影响.

模块名冲突: 把同名模块放在不同包中.

导入模块

from math import log
from logging import log as logger

引用时: 使用完整的路径(包+模块名). ex. p1.util.f()

动态导入模块

try:
    from cStringIO import ...

[Algorithms II] Week 6-1 Reductions

Fri, 19 Feb 2016 Category notes algorithm Series Part 11 of «Algorithms Princeton MOOC II»

Goal: classify problems according to computational requirements.
bad new: for huge number of pbs we don't know...

1. Introduction to Reductions

shifing gears:

from individual problems to problem-solving models.
from linear/quard to polynomial/exponential pbs
from implementation details to conceptual framwork

suppose we could (not) solve pb X ...

[python进阶课程] 函数式编程

Wed, 17 Feb 2016 Category notes python Series Part 1 of «python进阶课程»

http://www.imooc.com/learn/317

函数式编程: 更抽象, 更脱离指令(计算机), 更贴近计算(数学).

不需要变量 (python允许有变量, 所以python非纯函数式)
高阶函数
闭包: 返回函数
匿名函数

高阶函数

变量可以指向函数 f=abs; f(-10)
函数名: 就是指向函数的变量 abs=len
高阶函数: 接收函数作为参数的函数

def add(x,y,f):
return f(x)+f(y)
add(-5, 9, abs)

map()

map()是 Python 内置的高阶函数 ...

C++ STL小结&代码片段

Sat, 09 Jan 2016 Category tech C++

总结了一下C++ STL里面用的比较频繁的一些代码片段. (地址: https://github.com/X-Wei/cpp-demo-snippets/tree/master/STL)
cpp文档: http://en.cppreference.com/w/cpp

常用的library主要有:
<algorithm>, <vector>, <queue>, <set>, <map>, <cmath>

另外一个常见的cpp文件开头版本是:

#include <iostream>  
#include <vector>  
#include <algorithm>  
using namespace std;  
#define forloop(i,lo,hi) for(int i = (lo); i <= (hi); i++)  
#define rep(i ...

[Algorithms II] Week 5-2 Data Compression

Mon, 04 Jan 2016 Category notes algorithm Series Part 10 of «Algorithms Princeton MOOC II»

1. Introduction to Data Compression

pb: reduce the size of a file, to save space/time for storing/transmitting.
applications: generic file compression(gzip), multimedia (mp3), communication(skype).

From binary data B, ⇒ generate a compressed representation C(B).

lossless compression: get exactly B from C(B)
compression ratio: |C(B ...