Open in app

Sign In

Write

Sign In

Jimmy (xiaoke) Shen
Jimmy (xiaoke) Shen

247 Followers

Home

About

Mar 14

GPT-4 is coming!

Paper Blog [Official] GPT-4 is OpenAI’s most advanced system, producing safer and more useful responses What is new in GPT-4 Support image as input: “multimodal model which can accept image and text inputs and produce text outputs. What is not changed? Use text as outputs Transformer based architecture Model architecture is based on Transformer, however, details are not available “Given both the competitive landscape and the safety implications of large-scale models like GPT-4, this report contains no further details about the architecture (including model size)…

Gpt 4

2 min read

GPT-4 is coming!
GPT-4 is coming!
Gpt 4

2 min read


Mar 14

Update docker containers’ default storage location on Ubuntu

The storage location of Docker images and containers [1] A Docker container consists of network settings, volumes, and images. The location of Docker files depends on your operating system: Ubuntu: /var/lib/docker/ Change storage location [2] Another often posted solution is to stop Docker: sudo systemctl stop docker Edit the /etc/default/docker file by adding the new location with the -g in the DOCKER_OPTS line:

1 min read

1 min read


Mar 10

Tutorials of Git and Git internal

Basic tutorials There are plenty of git tutorials and guides to get started and that explain the basic concepts. This post is not one of them. If that is what you need, you can check these great resources: git — the simple guide Git is a Directed Acyclic Graph and What the Heck Does That Mean? Think Like (a) Git Official documentation There is another quite popular resource, that doesn’t focus on explaining the concepts, but on what to do if you get into certain cases (aka problems):

Git

1 min read

Git

1 min read


Mar 3

pd datetime

Basic dataframe import pandas as pd import numpy as np dates = [str(y)+"-" +str(m)+"-"+str(d) for y in range(2000, 2001) for m in range(1, 13) for d in range(1, 20)] df = pd.DataFrame({"date": dates}) Change string to datetime >>> df["date_datetime"] = pd.to_datetime(df["date"]) >>> df.dtypes date object date_datetime datetime64[ns] Set a missing value >>> df.iloc[0, 1] = np.NaN >>> df…

Pandas

2 min read

Pandas

2 min read


Mar 3

Pandas internal

A deep understanding about Pandas — What is pandas Pandas is a popular package used for processing data. In this article, some useful tutorials regarding the pandas internals will be introduced to help Data Scientis or Engineers to have a deeper understanding about this package. pandas is a Python package that provides fast, flexible, and expressive data structures designed…

Pandas Internal

3 min read

Pandas internal
Pandas internal
Pandas Internal

3 min read


Mar 1

Multimodal Large Language Model (MLLM)

A recent paper by Microsoft titled: Language Is Not All You Need: Aligning Perception with Language Models It is recently (Feb 27, 2023)available on arxiv. The paper is well written, highly recommend to read the paper by yourself. One example I want to highlight here is below: Isn’t this is…

Multimodal

2 min read

Multimodal Large Language Model (MLLM)
Multimodal Large Language Model (MLLM)
Multimodal

2 min read


Feb 28

Exploration of the blog “Why Python is Slow: Looking Under the Hood”

“Why Python is Slow: Looking Under the Hood” is an interesting blog can be found here. In this article, I’d like to reproduce some interesting observations. Number of reference for integers The original code from [1] import matplotlib.pyplot as plt…

Python

2 min read

Exploration of the blog “Why Python is Slow: Looking Under the Hood”
Exploration of the blog “Why Python is Slow: Looking Under the Hood”
Python

2 min read


Feb 27

Cool blogs

In this article, I’d like to list some very cool blogs related to software engineering, data science, machine learning, etc. Guido van Rossum Stefan Behnel Serokell https://lobste.rs Laurence Tratt Jake VanderPlas Marc Garcia Yinwang (in Chinese) Yinwang English Wiki I will update those list. If you find any blogs interesting, please leave me a comment and I will update accordingly. Thanks!

Machine Learning

1 min read

Machine Learning

1 min read


Feb 24

Dynamically building the heap

Question IPO Naive Solution got TLE The time complexity is about O(k*m), where m is number of unique profits. The basic idea is organize the data by profit and each time checking from higher profit to lower profit to harvest the highest profit project. class Solution { public: int findMaximizedCapital(int k, int w…

Heap

2 min read

Heap

2 min read


Feb 20

Why Haskell and How to Learn?

In this article, I’d like to put some links of why Haskell is interesting and why we should learn it. Reference [1] 15 Resources to Help You Learn Haskell in 2023 [2] Python VS Haskell [3] A Pythonista’s Review of Haskell [4] Haskell for python developers [5] What can Python learn from Haskell? [video] [6] Why learning Haskell/Python makes you a worse programmer [7] Technical basics series: From Python to Haskell

Haskel

1 min read

Haskel

1 min read

Jimmy (xiaoke) Shen

Jimmy (xiaoke) Shen

247 Followers

Data Scientist/MLE/SWE @takemobi

Following
  • Liquid Ocelot

    Liquid Ocelot

  • MicroBioscopicData

    MicroBioscopicData

  • Yogesh Haribhau Kulkarni (PhD)

    Yogesh Haribhau Kulkarni (PhD)

  • Dariusz Gross #DATAsculptor

    Dariusz Gross #DATAsculptor

  • Rahim Uddin

    Rahim Uddin

Help

Status

Writers

Blog

Careers

Privacy

Terms

About

Text to speech