Punctuating Japanese Transcripts using MeCAB Tokenizer and BiLSTM-CRF

Posted on 2019-11-08

Punctuating passages is a difficult task for Neural Network. The task gets even more challenging when it is a spoken transcript in a language like Japanese. Unlike English, There is no space between words for Japanese.
To oversimplify the task, these are the 3 main steps required foe this task. I got the best results using BiLSTM CRF model.

We first need to tokenize the words using a Tokenizer.
Then design the Input Sequence with relevant context.
Do some feature Engineering and make the prediction.

Punctuation

Deploying keras Model to Production on Google Cloud

Posted on 2019-06-18

Deploying a model to production is an important skill that a Machine Learning Engineer should add to his skillset. I trained a MNIST model and created a writing interface. User can draw a digit using the interface and the model would predict the digit.
App Running

I made an app.yaml file and tried deploying it on Google Cloud using Google Cloud SDK. After successful deployment, there was a server error when using the command gcloud app browse(maybe due to error in the app.yaml)

I trained an AI to generate r/bitcoin post titles, and it said: I am mining in the future

Posted on 2019-01-02

Since I was getting bored in my holidays, I figured it might be funny to train an AI on the r/bitcoin subreddit post titles and the results are amazing.

Some highlights:

When you ‘ re not holding your keys ; you are in the us government

I’ve been surviving off ramen the past few days ago .

Bitcoin cash will not take over bitcoin .

Hey, coinbase and other people to buy bitcoin .

How i feel great ! i ‘ m fast being priced out of it

Bitcoin is a tool for freeing humanity from oligarchs and tyrants , dressed up as a first time ever - bbc news

I just bought my first bitcoin investment has finally doubled ! ! ! ! ! ! ! ! !

I ‘ m a low level bitcoiner . . . .

Bitcoin is a “ firmware problem “

We are now being sold for bitcoin

I ‘ m a ucla student who started a blog explaining to millennials why bitcoin is a good idea

Bitcoin is going to eat my dick if wrong .

I ‘ m not going down !

Owning 1 bitcoin = 3 ounce of gold ! if they should rebrand to anti - bitcoin will be able to pay my waiter his tip in btc

i am a proud son .

A reminder why bitcoin is going to be a part in my mind . . . . .

We are now actively contributing code and communicated on the bitcoin network .

I am finally in the us senate want to sell bitcoins

Bitcoin is still an extremely risky and speculative investment.

Can we please activate segwit faster .

Bitcoin is at $ 3000 all time high , everyone is acting like their dog just died .

What if the bank runs out of the latest , and whom i ‘ m upgraded to be shitting himself so hard about bitcoin

Welcome to the bitcoin vs stocks vs gold vs silver

Reminder : hard fork

20 mining bashing on unlimited way to ukraine

Former fed employee fined for installing bitcoin mining

A shilling to time we got here in japan

This scam is unacceptable . Let’s getting out of that’s to this subreddit whenever bitcoin is signalling

Secure , stable and efficient smart contracts

Don ‘ t believe , these numbers aren’t fuck more than 200 crypto atms in united states vs coinbase inc.

Bitcoin is a shitcoin.

Woke up after 4 - years of people need to worry about ?

Iran just officially stopped accepting dollars for oil payments .

Wow , andreas is intelligent .

Bitcoin core dev will support segwit . maybe next time

Good news - the fighting phase has started painting bitcoin art themes , i hey yesterday , be honest .

Hence bitcoin is now signalling segwit

I just lost one third of my whole day of work function

Biggest venezuelan exchange announces its banking situation is resolved , fiat transactions

Breadwallet becomes bread , a tad bit drunk and just laughing my ass off

Spent this afternoon creating a generation of people that write the code by accident

Yesterday was my birthday and i think my neighbour is a big deal

Up 800 % this year # btc ? lol

Sessions : trump directed law enforcement and regulatory agencies

The pump has never been more obvious , and we wouldn’t give a student some good investing advice - bitcoinity usd $ 11 , 000

Segwit locked in the philippines !

Bitpay just did this ?

Thank you.

The b2x developer does not cut it anymore .

Bitcoin is a good idea

Dumps:

I will upload the Dumps soon

AI seems to like Bitcoin and hate US government.

Landmark Retrieval

Posted on 2018-11-28

Landmark Retrieval Model.

Result after training.
Sample Image Added via Markdown

Jupyter Notebook link is written below
It is not interactive

Visit GITHUB for Jupyter Notebook of the Landmark Retrieval Model

Classic Fashion-MNIST Dataset

Posted on 2018-10-19

The original MNIST dataset contains a lot of handwritten digits. Members of the AI/ML/Data Science community love this dataset and use it as a benchmark to validate their algorithms. In fact, MNIST is often the first dataset researchers try. “If it doesn’t work on MNIST, it won’t work at all”, they said. “Well, if it does work on MNIST, it may still fail on others.”

To Serious Machine Learning Researchers

MNIST is too easy.
MNIST is overused.
MNIST can not represent modern CV tasks.

MNIST Dataset contains 60000 images of 10 different classes.
I was able to get an accuracy of 91.2 % on this Dataset.
It is not interactive

Visit GITHUB for Jupyter Notebook of the Fashion-mnist Classification Model

Supervised Learning on very Large Dataset

Posted on 2018-09-28

Supervised Learning is unarguable one of the most important use-case for Machine Learning. And yet its important is not recognised by most researchers. Supervised Learning makes up a large fraction of the work done by AI companies. Being Proficient in Supervised learning would really help the company satisfy their client faster and enable them to earn more profit.

In this task, I worked on Rossmann Dataset. Rossmann Dataset consists of data of all German Supermarket. The task is to predict sales for next 2 weeks. Click below for the Jupyter Notebook Link.
It is not interactive

Visit GITHUB for Jupyter Notebook of the Rossmann Dataset Model

Image Classification-Done Successfully

Posted on 2018-09-28

Image Classification is a really exciting task which can be done with Machine Learning. It is really magical to see the Computer understand images by itself.

Machine Learning can do Image Classification with very high accuracy. I would be working on Dogsbreed Dataset. It consists of images of 120 different types of Dogsbreeds, all of which look very similar. I was able to get very high accuracy. Click the link below for the Jupyter Notebook link.
It is not interactive

Visit GITHUB for Jupyter Notebook of the Dogsbreed Classification Model

LSTM: Text Generator using PyTorch

Posted on 2018-09-27

PyTorch is an open source machine learning library backed by Facebook. It is the new hot thing in the town that everyone is talking about. I tried to see for myself if PyTorch is easier to use than TensorFlow.

We would be working on an LSTM:Text Generator. We would be making RNN models with increasing complexity until we make a LSTM.

Visit GITHUB for Jupyter Notebook of the LSTM Text Generator

%reload_ext autoreload
%autoreload 2
%matplotlib inline

from fastai.io import *
from fastai.conv_learner import *

from fastai.column_data import *

Setup

1	PATH='data/nietzsche/'

1 2	text = open(f'{PATH}nietzsche.txt').read() print('corpus length:', len(text))

corpus length: 40000

1	text[:400]

'PREFACE\n\n\nSUPPOSING that Truth is a woman--what then? Is there not ground\nfor suspecting that all philosophers, in so far as they have been\ndogmatists, have failed to understand women--that the terrible\nseriousness and clumsy importunity with which they have usually paid\ntheir addresses to Truth, have been unskilled and unseemly methods for\nwinning a woman? Certainly she has never allowed herself '

1
2
3

chars = sorted(list(set(text)))
vocab_size = len(chars)+1
print('total chars:', vocab_size)

total chars: 76

Sometimes it’s useful to have a zero value in the dataset, e.g. for padding

1
2
3

chars.insert(0, "\0")

''.join(chars[1:-6])

'\n !"\'(),-.0123456789:;?ABCDEFGHIJKLMNOPQRSTUVWXY_abcdefghijklmnopqrst'

1 2	char_indices = {c: i for i, c in enumerate(chars)} indices_char = {i: c for i, c in enumerate(chars)}

1
2
3

idx = [char_indices[c] for c in text]

idx[:10]

[39, 41, 28, 29, 24, 26, 28, 1, 1, 1]

1	''.join(indices_char[i] for i in idx[:70])

'PREFACE\n\n\nSUPPOSING that Truth is a woman--what then? Is there not gro'

Three char model

Create inputs

cs=3
c1_dat = [idx[i]   for i in range(0, len(idx)-cs, cs)]
c2_dat = [idx[i+1] for i in range(0, len(idx)-cs, cs)]
c3_dat = [idx[i+2] for i in range(0, len(idx)-cs, cs)]
c4_dat = [idx[i+3] for i in range(0, len(idx)-cs, cs)]

Our inputs

1
2
3

x1 = np.stack(c1_dat)
x2 = np.stack(c2_dat)
x3 = np.stack(c3_dat)

Our output

1	y = np.stack(c4_dat)

1	x1.shape, y.shape

((13333,), (13333,))

Create and train model

1	n_hidden = 256

1	n_fac = 42

class Char3Model(nn.Module):
    def __init__(self, vocab_size, n_fac):
        super().__init__()
        self.e = nn.Embedding(vocab_size, n_fac)

        # The 'green arrow' from our diagram - the layer operation from input to hidden
        self.l_in = nn.Linear(n_fac, n_hidden)

        # The 'orange arrow' from our diagram - the layer operation from hidden to hidden
        self.l_hidden = nn.Linear(n_hidden, n_hidden)

        # The 'blue arrow' from our diagram - the layer operation from hidden to output
        self.l_out = nn.Linear(n_hidden, vocab_size)

    def forward(self, c1, c2, c3):
        in1 = F.relu(self.l_in(self.e(c1)))
        in2 = F.relu(self.l_in(self.e(c2)))
        in3 = F.relu(self.l_in(self.e(c3)))

        h = V(torch.zeros(in1.size()))
        h = F.tanh(self.l_hidden(h+in1))
        h = F.tanh(self.l_hidden(h+in2))
        h = F.tanh(self.l_hidden(h+in3))

        return F.log_softmax(self.l_out(h))

1	md = ColumnarModelData.from_arrays('.', [-1], np.stack([x1,x2,x3], axis=1), y, bs=512)

1	m = Char3Model(vocab_size, n_fac)

1
2
3

it = iter(md.trn_dl)
*xs,yt = next(it)
t = m(*V(xs))

1	opt = optim.Adam(m.parameters(), 1e-2)

1	fit(m, md, 1, opt, F.nll_loss)

HBox(children=(IntProgress(value=0, description='Epoch', max=1), HTML(value='')))


epoch      trn_loss   val_loss                            
    0      2.911144   3.086874  






[array([3.08687])]

1	set_lrs(opt, 0.001)

1	fit(m, md, 1, opt, F.nll_loss)

A Jupyter Widget


[ 0.       1.84525  6.52312]

Test model

def get_next(inp):
    idxs = T(np.array([char_indices[c] for c in inp]))
    p = m(*VV(idxs))
    i = np.argmax(to_np(p))
    return chars[i]

1	get_next('y. ')

'p'

1	get_next('ppl')

'e'

1	get_next(' th')

'e'

1	get_next('and')

' '

Our first RNN!

Create inputs

cs=8

1	c_in_dat = [[idx[i+j] for i in range(cs)] for j in range(len(idx)-cs)]

1	c_out_dat = [idx[j+cs] for j in range(len(idx)-cs)]

1	xs = np.stack(c_in_dat, axis=0)

xs.shape

(39992, 8)

1	y = np.stack(c_out_dat)

Create and train model

1	val_idx = get_cv_idxs(len(idx)-cs-1)

1	md = ColumnarModelData.from_arrays('.', val_idx, xs, y, bs=512)

class CharLoopModel(nn.Module):
    # This is an RNN!
    def __init__(self, vocab_size, n_fac):
        super().__init__()
        self.e = nn.Embedding(vocab_size, n_fac)
        self.l_in = nn.Linear(n_fac, n_hidden)
        self.l_hidden = nn.Linear(n_hidden, n_hidden)
        self.l_out = nn.Linear(n_hidden, vocab_size)

    def forward(self, *cs):
        bs = cs[0].size(0)
        h = V(torch.zeros(bs, n_hidden))
        for c in cs:
            inp = F.relu(self.l_in(self.e(c)))
            h = F.tanh(self.l_hidden(h+inp))

        return F.log_softmax(self.l_out(h), dim=-1)

1 2	m = CharLoopModel(vocab_size, n_fac) opt = optim.Adam(m.parameters(), 1e-2)

1	fit(m, md, 1, opt, F.nll_loss)

HBox(children=(IntProgress(value=0, description='Epoch', max=1), HTML(value='')))


epoch      trn_loss   val_loss                            
    0      2.540714   2.313722  






[array([2.31372])]

1	set_lrs(opt, 0.001)

1	fit(m, md, 1, opt, F.nll_loss)

A Jupyter Widget


[ 0.       1.73588  1.75103]

class CharLoopConcatModel(nn.Module):
    def __init__(self, vocab_size, n_fac):
        super().__init__()
        self.e = nn.Embedding(vocab_size, n_fac)
        self.l_in = nn.Linear(n_fac+n_hidden, n_hidden)
        self.l_hidden = nn.Linear(n_hidden, n_hidden)
        self.l_out = nn.Linear(n_hidden, vocab_size)

    def forward(self, *cs):
        bs = cs[0].size(0)
        h = V(torch.zeros(bs, n_hidden))
        for c in cs:
            inp = torch.cat((h, self.e(c)), 1)
            inp = F.relu(self.l_in(inp))
            h = F.tanh(self.l_hidden(inp))

        return F.log_softmax(self.l_out(h), dim=-1)

1 2	m = CharLoopConcatModel(vocab_size, n_fac) opt = optim.Adam(m.parameters(), 1e-3)

1
2
3

it = iter(md.trn_dl)
*xs,yt = next(it)
t = m(*V(xs))

1	fit(m, md, 1, opt, F.nll_loss)

HBox(children=(IntProgress(value=0, description='Epoch', max=1), HTML(value='')))


epoch      trn_loss   val_loss                            
    0      2.955463   2.684819  






[array([2.68482])]

1	set_lrs(opt, 1e-4)

1	fit(m, md, 1, opt, F.nll_loss)

A Jupyter Widget


[ 0.       1.69008  1.69936]

Test model

def get_next(inp):
    idxs = T(np.array([char_indices[c] for c in inp]))
    p = m(*VV(idxs))
    i = np.argmax(to_np(p))
    return chars[i]

1	get_next('for thos')

' '

1	get_next('part of ')

't'

1	get_next('queens a')

'n'

RNN with pytorch

class CharRnn(nn.Module):
    def __init__(self, vocab_size, n_fac):
        super().__init__()
        self.e = nn.Embedding(vocab_size, n_fac)
        self.rnn = nn.RNN(n_fac, n_hidden)
        self.l_out = nn.Linear(n_hidden, vocab_size)

    def forward(self, *cs):
        bs = cs[0].size(0)
        h = V(torch.zeros(1, bs, n_hidden))
        inp = self.e(torch.stack(cs))
        outp,h = self.rnn(inp, h)

        return F.log_softmax(self.l_out(outp[-1]), dim=-1)

1 2	m = CharRnn(vocab_size, n_fac) opt = optim.Adam(m.parameters(), 1e-3)

1 2	it = iter(md.trn_dl) *xs,yt = next(it)

1 2	t = m.e(V(torch.stack(xs))) t.size()

torch.Size([8, 512, 42])

1
2
3

ht = V(torch.zeros(1, 512,n_hidden))
outp, hn = m.rnn(t, ht)
outp.size(), hn.size()

(torch.Size([8, 512, 256]), torch.Size([1, 512, 256]))

1	t = m(*V(xs)); t.size()

torch.Size([512, 76])

1	fit(m, md, 4, opt, F.nll_loss)

A Jupyter Widget


[ 0.       1.86065  1.84255]                                 
[ 1.       1.68014  1.67387]                                 
[ 2.       1.58828  1.59169]                                 
[ 3.       1.52989  1.54942]

1	set_lrs(opt, 1e-4)

1	fit(m, md, 2, opt, F.nll_loss)

HBox(children=(IntProgress(value=0, description='Epoch', max=2), HTML(value='')))


epoch      trn_loss   val_loss                            
    0      2.92928    2.668219  
    1      2.611671   2.442238                            






[array([2.44224])]

Test model

def get_next(inp):
    idxs = T(np.array([char_indices[c] for c in inp]))
    p = m(*VV(idxs))
    i = np.argmax(to_np(p))
    return chars[i]

1	get_next('for thos')

' '

def get_next_n(inp, n):
    res = inp
    for i in range(n):
        c = get_next(inp)
        res += c
        inp = inp[1:]+c
    return res

1	get_next_n('for thos', 40)

'for thos the the the the the the the the the the'

Multi-output model

Setup

Let’s take non-overlapping sets of characters this time

1	c_in_dat = [[idx[i+j] for i in range(cs)] for j in range(0, len(idx)-cs-1, cs)]

Then create the exact same thing, offset by 1, as our labels

1	c_out_dat = [[idx[i+j] for i in range(cs)] for j in range(1, len(idx)-cs, cs)]

1 2	xs = np.stack(c_in_dat) xs.shape

(4999, 8)

1 2	ys = np.stack(c_out_dat) ys.shape

(4999, 8)

1	xs[:cs,:cs]

array([[39, 41, 28, 29, 24, 26, 28,  1],
       [ 1,  1, 42, 44, 39, 39, 38, 42],
       [32, 37, 30,  2, 69, 57, 50, 69],
       [ 2, 43, 67, 70, 69, 57,  2, 58],
       [68,  2, 50,  2, 72, 64, 62, 50],
       [63,  9,  9, 72, 57, 50, 69,  2],
       [69, 57, 54, 63, 23,  2, 32, 68],
       [ 2, 69, 57, 54, 67, 54,  2, 63]])

1	ys[:cs,:cs]

array([[41, 28, 29, 24, 26, 28,  1,  1],
       [ 1, 42, 44, 39, 39, 38, 42, 32],
       [37, 30,  2, 69, 57, 50, 69,  2],
       [43, 67, 70, 69, 57,  2, 58, 68],
       [ 2, 50,  2, 72, 64, 62, 50, 63],
       [ 9,  9, 72, 57, 50, 69,  2, 69],
       [57, 54, 63, 23,  2, 32, 68,  2],
       [69, 57, 54, 67, 54,  2, 63, 64]])

Create and train model

1	val_idx = get_cv_idxs(len(xs)-cs-1)

1	md = ColumnarModelData.from_arrays('.', val_idx, xs, ys, bs=512)

class CharSeqRnn(nn.Module):
    def __init__(self, vocab_size, n_fac):
        super().__init__()
        self.e = nn.Embedding(vocab_size, n_fac)
        self.rnn = nn.RNN(n_fac, n_hidden)
        self.l_out = nn.Linear(n_hidden, vocab_size)

    def forward(self, *cs):
        bs = cs[0].size(0)
        h = V(torch.zeros(1, bs, n_hidden))
        inp = self.e(torch.stack(cs))
        outp,h = self.rnn(inp, h)
        return F.log_softmax(self.l_out(outp), dim=-1)

1 2	m = CharSeqRnn(vocab_size, n_fac) opt = optim.Adam(m.parameters(), 1e-3)

1 2	it = iter(md.trn_dl) *xst,yt = next(it)

def nll_loss_seq(inp, targ):
    sl,bs,nh = inp.size()
    targ = targ.transpose(0,1).contiguous().view(-1)
    return F.nll_loss(inp.view(-1,nh), targ)

1	fit(m, md, 4, opt, nll_loss_seq)

A Jupyter Widget


[ 0.       2.59241  2.40251]                                
[ 1.       2.28474  2.19859]                                
[ 2.       2.13883  2.08836]                                
[ 3.       2.04892  2.01564]

1	set_lrs(opt, 1e-4)

1	fit(m, md, 1, opt, nll_loss_seq)

HBox(children=(IntProgress(value=0, description='Epoch', max=1), HTML(value='')))


epoch      trn_loss   val_loss                          
    0      4.32723    4.291134  






[array([4.29113])]

Identity init!

1 2	m = CharSeqRnn(vocab_size, n_fac) opt = optim.Adam(m.parameters(), 1e-2)

1	m.rnn.weight_hh_l0.data.copy_(torch.eye(n_hidden))

    1     0     0  ...      0     0     0
    0     1     0  ...      0     0     0
    0     0     1  ...      0     0     0
       ...          ⋱          ...       
    0     0     0  ...      1     0     0
    0     0     0  ...      0     1     0
    0     0     0  ...      0     0     1
[torch.FloatTensor of size 256x256]

1	fit(m, md, 4, opt, nll_loss_seq)

HBox(children=(IntProgress(value=0, description='Epoch', max=4), HTML(value='')))


epoch      trn_loss   val_loss                          
    0      3.490897   3.1025    
    1      3.235987   2.869533                          
    2      3.063832   2.690163                          
    3      2.929215   2.572571                          






[array([2.57257])]

1	set_lrs(opt, 1e-3)

1	fit(m, md, 4, opt, nll_loss_seq)

A Jupyter Widget


[ 0.       1.84035  1.85742]                                
[ 1.       1.82896  1.84887]                                
[ 2.       1.81879  1.84281]                               
[ 3.       1.81337  1.83801]

Stateful model

Setup

from torchtext import vocab, data

from fastai.nlp import *
from fastai.lm_rnn import *

PATH='data/nietzsche/'

TRN_PATH = 'trn/'
VAL_PATH = 'val/'
TRN = f'{PATH}{TRN_PATH}'
VAL = f'{PATH}{VAL_PATH}'

%ls {PATH}

nietzsche.txt       nietzschesmall.txt  [34mtrn[m[m/                [34mval[m[m/

1	%ls {PATH}trn

nietzschesmall.txt

TEXT = data.Field(lower=True, tokenize=list)
bs=64; bptt=8; n_fac=42; n_hidden=256

FILES = dict(train=TRN_PATH, validation=VAL_PATH, test=VAL_PATH)
md = LanguageModelData.from_text_files(PATH, TEXT, **FILES, bs=bs, bptt=bptt, min_freq=3)

len(md.trn_dl), md.nt, len(md.trn_ds), len(md.trn_ds[0].text)

(75, 45, 1, 39383)

RNN

class CharSeqStatefulRnn(nn.Module):
    def __init__(self, vocab_size, n_fac, bs):
        self.vocab_size = vocab_size
        super().__init__()
        self.e = nn.Embedding(vocab_size, n_fac)
        self.rnn = nn.RNN(n_fac, n_hidden)
        self.l_out = nn.Linear(n_hidden, vocab_size)
        self.init_hidden(bs)

    def forward(self, cs):
        bs = cs[0].size(0)
        if self.h.size(1) != bs: self.init_hidden(bs)
        outp,h = self.rnn(self.e(cs), self.h)
        self.h = repackage_var(h)
        return F.log_softmax(self.l_out(outp), dim=-1).view(-1, self.vocab_size)

    def init_hidden(self, bs): self.h = V(torch.zeros(1, bs, n_hidden))

1 2	m = CharSeqStatefulRnn(md.nt, n_fac, 512) opt = optim.Adam(m.parameters(), 1e-3)

1	fit(m, md, 4, opt, F.nll_loss)

A Jupyter Widget


[ 0.       1.81983  1.81247]                                 
[ 1.       1.63097  1.66228]                                 
[ 2.       1.54433  1.57824]                                 
[ 3.       1.48563  1.54505]

1
2
3

set_lrs(opt, 1e-4)

fit(m, md, 1, opt, F.nll_loss)

HBox(children=(IntProgress(value=0, description='Epoch', max=1), HTML(value='')))


epoch      trn_loss   val_loss                            
    0      3.223129   2.929459  






[array([2.92946])]

RNN loop

# From the pytorch source

def RNNCell(input, hidden, w_ih, w_hh, b_ih, b_hh):
    return F.tanh(F.linear(input, w_ih, b_ih) + F.linear(hidden, w_hh, b_hh))

class CharSeqStatefulRnn2(nn.Module):
    def __init__(self, vocab_size, n_fac, bs):
        super().__init__()
        self.vocab_size = vocab_size
        self.e = nn.Embedding(vocab_size, n_fac)
        self.rnn = nn.RNNCell(n_fac, n_hidden)
        self.l_out = nn.Linear(n_hidden, vocab_size)
        self.init_hidden(bs)

    def forward(self, cs):
        bs = cs[0].size(0)
        if self.h.size(1) != bs: self.init_hidden(bs)
        outp = []
        o = self.h
        for c in cs:
            o = self.rnn(self.e(c), o)
            outp.append(o)
        outp = self.l_out(torch.stack(outp))
        self.h = repackage_var(o)
        return F.log_softmax(outp, dim=-1).view(-1, self.vocab_size)

    def init_hidden(self, bs): self.h = V(torch.zeros(1, bs, n_hidden))

1 2	m = CharSeqStatefulRnn2(md.nt, n_fac, 512) opt = optim.Adam(m.parameters(), 1e-3)

1	fit(m, md, 4, opt, F.nll_loss)

HBox(children=(IntProgress(value=0, description='Epoch', max=4), HTML(value='')))


epoch      trn_loss   val_loss                            
    0      2.634824   2.435599  
    1      2.397607   2.288882                            
    2      2.261155   2.169498                            
    3      2.158415   2.082467                            






[array([2.08247])]

GRU

class CharSeqStatefulGRU(nn.Module):
    def __init__(self, vocab_size, n_fac, bs):
        super().__init__()
        self.vocab_size = vocab_size
        self.e = nn.Embedding(vocab_size, n_fac)
        self.rnn = nn.GRU(n_fac, n_hidden)
        self.l_out = nn.Linear(n_hidden, vocab_size)
        self.init_hidden(bs)

    def forward(self, cs):
        bs = cs[0].size(0)
        if self.h.size(1) != bs: self.init_hidden(bs)
        outp,h = self.rnn(self.e(cs), self.h)
        self.h = repackage_var(h)
        return F.log_softmax(self.l_out(outp), dim=-1).view(-1, self.vocab_size)

    def init_hidden(self, bs): self.h = V(torch.zeros(1, bs, n_hidden))

# From the pytorch source code - for reference

def GRUCell(input, hidden, w_ih, w_hh, b_ih, b_hh):
    gi = F.linear(input, w_ih, b_ih)
    gh = F.linear(hidden, w_hh, b_hh)
    i_r, i_i, i_n = gi.chunk(3, 1)
    h_r, h_i, h_n = gh.chunk(3, 1)

    resetgate = F.sigmoid(i_r + h_r)
    inputgate = F.sigmoid(i_i + h_i)
    newgate = F.tanh(i_n + resetgate * h_n)
    return newgate + inputgate * (hidden - newgate)

1
2
3

m = CharSeqStatefulGRU(md.nt, n_fac, 512)

opt = optim.Adam(m.parameters(), 1e-3)

1	fit(m, md, 6, opt, F.nll_loss)

HBox(children=(IntProgress(value=0, description='Epoch', max=6), HTML(value='')))


epoch      trn_loss   val_loss                            
    0      2.667542   2.425069  
    1      2.371198   2.23098                             
    2      2.207084   2.096106                            
    3      2.081467   1.982581                            
    4      1.971633   1.877548                            
    5      1.877026   1.794282                            






[array([1.79428])]

1	set_lrs(opt, 1e-4)

1	fit(m, md, 3, opt, F.nll_loss)

A Jupyter Widget


[ 0.       1.22708  1.36926]                                 
[ 1.       1.21948  1.3696 ]                                 
[ 2.       1.22541  1.36969]

Putting it all together: LSTM

1
2
3

from fastai import sgdr

n_hidden=512

class CharSeqStatefulLSTM(nn.Module):
    def __init__(self, vocab_size, n_fac, bs, nl):
        super().__init__()
        self.vocab_size,self.nl = vocab_size,nl
        self.e = nn.Embedding(vocab_size, n_fac)
        self.rnn = nn.LSTM(n_fac, n_hidden, nl, dropout=0.5)
        self.l_out = nn.Linear(n_hidden, vocab_size)
        self.init_hidden(bs)

    def forward(self, cs):
        bs = cs[0].size(0)
        if self.h[0].size(1) != bs: self.init_hidden(bs)
        outp,h = self.rnn(self.e(cs), self.h)
        self.h = repackage_var(h)
        return F.log_softmax(self.l_out(outp), dim=-1).view(-1, self.vocab_size)

    def init_hidden(self, bs):
        self.h = (V(torch.zeros(self.nl, bs, n_hidden)),
                  V(torch.zeros(self.nl, bs, n_hidden)))

1 2	m = CharSeqStatefulLSTM(md.nt, n_fac, 512, 2) lo = LayerOptimizer(optim.Adam, m, 1e-2, 1e-5)

1	os.makedirs(f'{PATH}models', exist_ok=True)

1	fit(m, md, 2, lo.opt, F.nll_loss)

HBox(children=(IntProgress(value=0, description='Epoch', max=2), HTML(value='')))


epoch      trn_loss   val_loss                            
    0      2.519296   2.222408  
    1      2.186267   1.941648                            






[array([1.94165])]

1
2
3

on_end = lambda sched, cycle: save_model(m, f'{PATH}models/cyc_{cycle}')
cb = [CosAnneal(lo, len(md.trn_dl), cycle_mult=2, on_cycle_end=on_end)]
fit(m, md, 1, lo.opt, F.nll_loss, callbacks=cb)

HBox(children=(IntProgress(value=0, description='Epoch', max=1), HTML(value='')))


epoch      trn_loss   val_loss                            
    0      1.616982   1.337493  





[array([1.33749])]

1
2
3

on_end = lambda sched, cycle: save_model(m, f'{PATH}models/cyc_{cycle}')
cb = [CosAnneal(lo, len(md.trn_dl), cycle_mult=2, on_cycle_end=on_end)]
fit(m, md, 2**6-1, lo.opt, F.nll_loss, callbacks=cb)

A Jupyter Widget


[ 0.       1.46053  1.43462]                                 
[ 1.       1.51537  1.47747]                                 
[ 2.       1.39208  1.38293]                                 
[ 3.       1.53056  1.49371]                                 
[ 4.       1.46812  1.43389]                                 
[ 5.       1.37624  1.37523]                                 
[ 6.       1.3173   1.34022]                                 
[ 7.       1.51783  1.47554]                                 
[ 8.       1.4921   1.45785]                                 
[ 9.       1.44843  1.42215]                                 
[ 10.        1.40948   1.40858]                              
[ 11.        1.37098   1.36648]                              
[ 12.        1.32255   1.33842]                              
[ 13.        1.28243   1.31106]                              
[ 14.        1.25031   1.2918 ]                              
[ 15.        1.49236   1.45316]                              
[ 16.        1.46041   1.43622]                              
[ 17.        1.45043   1.4498 ]                              
[ 18.        1.43331   1.41297]                              
[ 19.        1.43841   1.41704]                              
[ 20.        1.41536   1.40521]                              
[ 21.        1.39829   1.37656]                              
[ 22.        1.37001   1.36891]                              
[ 23.        1.35469   1.35909]                              
[ 24.        1.32202   1.34228]                              
[ 25.        1.29972   1.32256]                              
[ 26.        1.28007   1.30903]                              
[ 27.        1.24503   1.29125]                              
[ 28.        1.22261   1.28316]                              
[ 29.        1.20563   1.27397]                              
[ 30.        1.18764   1.27178]                              
[ 31.        1.18114   1.26694]                              
[ 32.        1.44344   1.42405]                              
[ 33.        1.43344   1.41616]                              
[ 34.        1.4346    1.40442]                              
[ 35.        1.42152   1.41359]                              
[ 36.        1.42072   1.40835]                              
[ 37.        1.41732   1.40498]                              
[ 38.        1.41268   1.395  ]                              
[ 39.        1.40725   1.39433]                              
[ 40.        1.40181   1.39864]                              
[ 41.        1.38621   1.37549]                              
[ 42.        1.3838    1.38587]                              
[ 43.        1.37644   1.37118]                              
[ 44.        1.36287   1.36211]                              
[ 45.        1.35942   1.36145]                              
[ 46.        1.34712   1.34924]                              
[ 47.        1.32994   1.34884]                              
[ 48.        1.32788   1.33387]                              
[ 49.        1.31553   1.342  ]                              
[ 50.        1.30088   1.32435]                              
[ 51.        1.28446   1.31166]                              
[ 52.        1.27058   1.30807]                              
[ 53.        1.26271   1.29935]                              
[ 54.        1.24351   1.28942]                              
[ 55.        1.23119   1.2838 ]                              
[ 56.        1.2086    1.28364]                              
[ 57.        1.19742   1.27375]                              
[ 58.        1.18127   1.26758]                              
[ 59.        1.17475   1.26858]                              
[ 60.        1.15349   1.25999]                              
[ 61.        1.14718   1.25779]                              
[ 62.        1.13174   1.2524 ]

Test

def get_next(inp):
    idxs = TEXT.numericalize(inp,device=-1)
    p = m(VV(idxs.transpose(0,1)))
    r = torch.multinomial(p[-1].exp(), 1)
    return TEXT.vocab.itos[to_np(r)[0]]

1	get_next('for thos')

'a'

def get_next_n(inp, n):
    res = inp
    for i in range(n):
        c = get_next(inp)
        res += c
        inp = inp[1:]+c
    return res

1	print(get_next_n('for thos', 400))

for thosal hunscholarigines--ors antion  of that the undressent--and asturebpessomethertogether; and wills. got mo all,watherein perhaps-begaprediced than it will to uppersion icceptameths of the truented term stylless belief your, the subtle<unk> a do long amatiour one could words to cognize of the philosophy, his fain bus-doorg as lesapertion in in shhope conscial or "invermancertay they awayge part of his

Life Hacks:The Start

Posted on 2018-09-27

There are lot of tasks which can be automated using programming. Work which usually takes hours, if not days, can be done in minutes. Life Hacks explores some of those tasks and tries to automate them.

Every program is written in such a way that it easily be modified to do a wide range of applications. This is extremely important. The reason being writing a new program for a different task(which is usually the case) takes a long time.

Please contact me by email if there are any bugs.

Life Hacks:Lucky Google

Posted on 2018-09-27

Google is one of the most frequently visited websites on the Internet. People have made huge gains by tweaking the way they use Google. New York Times let people read most article if they come from Google Search link. That’s saving 10$ per month. We would be messing with the Google and playaround with it.

In this program, we would make a program lucky.py which would open the top 5 Google search links in new tab automatically.

We import the following packages

requests - It gives us some of the functionality of the browser
sys - It lets us get the query before code is executed
webbrowser - It lets us open the browser
bs4 - It would help us search the required HTML tag

Steps:

We get the search query using sys
We use bs4 to find the URL of top 5 links.
We use webbrowser to open the webbrowser with those links.

import requests,sys,webbrowser,bs4
print('Googling....')
res =requests.get('http://google.com/search?q='+'+'.join(sys.argv[1:]))
# print('+'.join(sys.argv[1:]))
res.raise_for_status()
# print(res)
soup=bs4.BeautifulSoup(res.text, "html.parser")

linkElems = soup.select('.r a')

numOpen=min(5,len(linkElems))

webbrowser.open('http://google.com/search?q='+'+'.join(sys.argv[1:]))
for i in range(numOpen):
    webbrowser.open('http://google.com'+linkElems[i].get('href'))