Deploying Serge on GCP: A Step-by-Step Guide

3 minute read

Published: March 27, 2023

Serge is a chat interface that allows you to run Alpaca models without the need for any API keys. It is entirely self-hosted, fits on 4GB of RAM, and can run on the CPU. In this guide, we’ll walk you through the steps required to deploy Serge on Google Cloud Platform (GCP).

Before we get started, make sure you have a GCP account set up and are familiar with the basic concepts of GCP, including creating and managing virtual machines, and setting up firewall rules.

Step 1: Create a Virtual Machine

The first step in deploying Serge on GCP is to create a virtual machine. We recommend using a machine type with at least 4GB of RAM and a CPU. You can choose any operating system you like, but I recommend using a Linux-based distribution such as Ubuntu or Debian.

To create a virtual machine in GCP, follow these steps:

Go to console.cloud.google.com and create a new project (eg. serge)
Navigate to the Compute Engine section and click on the Create Instance button
Choose the machine type you want to use, and choose the region and the zone for your instance. In this example, we are picking the us-central1 region and us-central1-c zone. Also, we are gonna use a e2-standard-8 machine so that we can run models up to 30 billion parameters:

Select the operating system you want to run and configure the disk space. In this example, we are gonna use Ubuntu 22.04 LTS for the OS and a 250GB SSD as our boot disk. Click on the SELECT button:

Configure the firewall to allow HTTP/HTTPS traffic:

Leave everything else as it is and click on the CREATE button to create your virtual machine
Navigate to VPC networks and click on default
From the sidebar menu click on Firewall and then click on the Create a firewall rule button
Choose a name (eg serge-firewall) and on the Target tags field fill in the http-server tag. In the Source IPv4 ranges enable all by filling in 0.0.0.0/0. In the Protocols and ports section click on TCP and open port 8008 (we will need it for later). Click on the SAVE button:

Step 2: Install Dependencies

To get started with serge, follow these steps:

Navigate to VM Instances and click on the SSH button

Clone the serge repository:

git clone https://github.com/nsarrazin/serge.git
cd serge

Start the Docker container:
```
sudo docker-compose up -d
```

Download the required tokenizers:

sudo docker-compose exec serge python3 /usr/src/app/api/utils/download.py tokenizer 7B
sudo docker-compose exec serge python3 /usr/src/app/api/utils/download.py tokenizer 13B
sudo docker-compose exec serge python3 /usr/src/app/api/utils/download.py tokenizer 30B

Please note that the models occupy the following storage space: 7B requires 4.21G, 13B requires 8.14G, and 30B requires 20.3G

Step 3: Access the API

Once you’ve installed the dependencies and started the Docker container, you can access the serge API by following these steps:

Navigate to VM Instances and copy the External IP of your machine
Open your web browser and navigate to http://external-ip:8008/ and Voilà:

You should see the serge homepage, which means that the API is up and running!
To use the API, make requests to http://external-ip:8008/api/:

That’s it! You’re now ready to use serge on GCP. Happy coding!

Share on

Bluesky Facebook LinkedIn X (formerly Twitter)

My Google Cloud Digital Leader Journey 🚀

less than 1 minute read

Published: March 15, 2024

CloudDigitalleader-v4

Thrilled to share I’ve passed the Google Cloud Digital Leader exam again! This achievement 🏆 isn’t just about mastering cloud tech but also marks a step forward in my cloud computing journey.

The Google Cloud Digital Leader certification is crucial for those looking to showcase their skills in using Google Cloud to innovate and solve business challenges 🌐. It covers essential cloud concepts and Google Cloud services, emphasizing practical solutions aligned with Google Cloud best practices.

Preparing for this beta exam was a unique challenge, given its fresh content and untested questions. It demanded a deep dive into Google Cloud’s vast resources, hands-on practice, and engaging with the cloud community for insights.

Achieving this certification again reinforces my dedication to staying at the forefront of cloud technology and sharing my journey and insights with you all through my blog and website. Let’s continue exploring the cloud together! ☁️✨

🎉🌥️ Breaking News: The Clouds Have Spoken! 🌥️🎉

1 minute read

Published: February 21, 2024

top-cloud-computing-voice

In an unprecedented turn of events, the skies have cleared, and I just got the top cloud computing voice badge on LinkedIn! 🏅✨

It seems I’ve talked about the cloud so much that even the Cloud has noticed. ☁️🗣️

So, what does this mean? Will I start influencing the weather? Predicting rain with a mere glance at my server stats? Sending lightning-fast data transfers with a snap of my fingers? Only time will tell… ⚡️💻

But in all seriousness, I’m incredibly honored and humbled to receive this recognition. It’s a testament not just to my passion for cloud computing but to the vibrant community of professionals on LinkedIn who share, engage, and support each other’s growth. 🌱🤝

I promise to use this voice for good: sharing insights, demystifying the cloud for all, and maybe, just maybe, making the internet a slightly better (and funnier) place.

To all my fellow cloud enthusiasts, let’s keep the conversation going! 🌈☁️

And a fun fact to wrap this up: this celebratory post was actually written by ChatGPT. Even in celebrating cloud achievements, AI is here to lend a hand (or a word)! 🤖✍️

#CloudComputing #LinkedInBadge #TopVoice

The Game Changing Impact of Google Cloud Skills Boost Leaderboards 🚀

3 minute read

Published: February 19, 2024

Google Cloud Skills Boost Leaderboards - Image Generated by DALL-e Google Cloud Skills Boost - Image Generated by DALL-E

The advent of gamification in learning has transformed the educational landscape, making the acquisition of new skills not just a necessity but an engaging, interactive journey. Among the forefront of this innovative approach is the Google Cloud Skills Boost platform, renowned for its incorporation of a competitive yet educational leaderboard system. This system not only motivates learners but also adds a dynamic layer of interactivity to the learning process.

Understanding Google Cloud Skills Boost

Google Cloud Skills Boost stands as a beacon for cloud learning enthusiasts, offering an expansive range of courses, labs, and quizzes tailored to elevate one’s expertise in Google Cloud technologies. The platform is designed to cater to various learning objectives, from beginners seeking foundational knowledge to professionals aiming to hone their skills. Through participation in different activities, learners accumulate points, fostering a tangible sense of progress and achievement.

Google Cloud Skills Boost Promotion to Silver League 2024-01-09 My promotion to the Silver League on Google Cloud Skills Boost

The Mechanics of Gamification in Learning

Gamification taps into the human psyche by stimulating the reward centers of the brain, thus encouraging competition and instilling a profound sense of accomplishment among learners. This psychological underpinning is what makes Google Cloud Skills Boost particularly effective. As participants engage with the material, completing courses and excelling in quizzes, they earn points and advance through leagues — from Bronze to the coveted Diamond level — mirroring a real-world progression system that keeps learners motivated and engaged.

The Role of Leaderboards

At the heart of the platform’s gamification strategy lies the leaderboard system. Each learner is placed within a cohort of 30, fostering a healthy competitive environment that drives continuous improvement. The weekly leaderboard challenges not only encourage consistent learning but also celebrate the top performers who advance to higher leagues, thereby enhancing the learning experience through competition and recognition.

Google Cloud Skills Boost Gold Leaderboard - 2024-01-27 Standing at the top of the Gold Leaderboard

Inclusivity and Flexibility in Learning

Google Cloud Skills Boost is commendable for its inclusive approach to learning. The platform ensures that taking breaks does not penalize learners; instead, it offers a system where progress can be paused, preventing any loss in league standings or leaderboard positions. Moreover, the use of randomized player names safeguards privacy and promotes anonymity, making the learning experience safe and inclusive for all participants.

Google Cloud Skills Boost Promotion to Gold League 2024-01-17 My promotion to the Gold League on Google Cloud Skills Boost

The Impact on Learning and Engagement

My personal journey with Google Cloud Skills Boost led to the achievement of two certifications, underscoring the platform’s efficacy in facilitating skill acquisition and professional development. The gamified approach has shown to significantly boost learning outcomes and engagement levels. The option to opt-out respects user preferences, highlighting the platform’s commitment to user-centric learning. Embracing the philosophy of improving “just 1% per day” can exponentially enhance one’s abilities over time, embodying the essence of continuous growth and development.

Google Cloud Skills Boost Silver Leaderboard - 2024-01-10 Leading the Silver Leaderboard

Conclusion:

The gamification of learning, as exemplified by Google Cloud Skills Boost, is revolutionizing the way we approach education and professional development. By making learning an engaging, competitive, and rewarding process, these platforms are setting a new standard for educational experiences. As we look forward, the potential of such systems to adapt and thrive in various learning environments promises a future where acquiring new skills is not just beneficial but a genuinely enjoyable pursuit. 🌟

Google Cloud Skills Boost Data Engineer Learning Path The Data Engineer Learning Path that helped me advance

Navigating Cloud Security: Insights from the Google Cloud H1 2024 Threat Horizons Report

1 minute read

Published: February 16, 2024

dall-e-unveiling-the-hidden-dangers-of-cloud-security

In the rapidly evolving cyber world, the first half of 2024 has highlighted crucial vulnerabilities within cloud configurations. Cryptomining, leveraging weak cloud setups, remains a dominant threat, underscoring the urgent need for robust security measures. This period has also seen a rise in ransomware attacks and data theft, challenging organizations to reinforce their defenses to safeguard their cloud environments.

Logging practices have emerged as a beacon of hope, offering illuminating insights into potential breaches and abnormal activities. Proper log management is not just a tool but a necessity for early detection and mitigation of threats.

Moreover, the landscape is increasingly being shaped by Advanced Persistent Threat (APT) actors, particularly those linked to the People’s Republic of China, who are targeting cloud infrastructures with sophisticated strategies. These actors exploit vulnerabilities to conduct espionage, data theft, and other malicious activities, highlighting the critical importance of vigilant cloud security measures.

The report calls for an integrated approach to cloud security, emphasizing the importance of continuous monitoring, advanced threat detection, and the implementation of best practices. By staying informed and prepared, businesses can navigate the complex security challenges of the cloud and protect their digital assets against the sophisticated threats of today and tomorrow.

For a detailed exploration of the emerging threats and strategic recommendations, visit Threat Horizons Report H1 2024.

Stay ahead, stay secure.

Paraskevas K. Leivadaros