Vista Normal

Hay nuevos artículos disponibles. Pincha para refrescar la página.
AnteayerA step up from a home lab

Recommendations on how to configure my homelab (this is a cross post from learningml)

I am looking for some recommendations on how to set up my homelab. Specifically with software/technologies

I have:

3x R630s with 512GB each and 44t/88c

1x R730 with 384GB 36c/72t and a 42x16TB drive JBOD DAS array attached, a 4x NVME 2TB pcie card, and a GTX1660 (currently running unraid, but might change that)

1x R420 with 96GB RAM and 32c/64t cpus (I think)

1x C4140 with 16c/32t, 256GB ram, and 4x P100 GPUs (just bought V100s to replace)

All servers have Connectx3 cards in them (40G/56G) and a SX6036 switch. I just got these and have no idea what I am doing yet.. All servers also have dual 10G SPF Nics that are connected to a switch for regular ethernet

and my workstation that has a threadripper 5995wx, 1TB Ram, and 4x 3090s (will be upgraded to 5090s when they drop). It is running windows and WSL (also dual booted to Ubuntu 22.04 due to a bug with WSL and 4 GPUs)

I have a large dataset taking up 70% of the 500TBs from commoncrawl. I was thinking K8s with the r420 as the master and 630s as worker nodes. I also might throw the 4140 and the 730 in the cluster too. I currently have Minio on a docker image on the 730 but I think it is slow for what I am trying to do, therefore I was going to move it to the K8s cluster but I only have 1 chassis for the drives. I see all this other technology (Hadoop, Spark, Minio, etc). I am doing this to learn primarily. The only way I really learn is hands on. My goal is to try to replicate what the big guys do, at a much smaller scale, but learning the technologies that I will need if I want to shift into this field. So given this layout, wanting to be able to build models and use the hardware as efficiently as possible (meaning if I am preprocessing, all CPUs are at full tilt until its done, if I am training all GPUs are at full tilt until its done) and storage access is as fast as I can make it, how would you configure this?

Also, if there is something I need to buy that is inexpensive to make this much better, I am open to suggestions.

edit:

I also need the dataset externally accessible (that is why I am using Minio)

tl;dr:

given this equipment, and the workload (also being a home lab) how would you configure it? Do i bring in the 730 into the cluster, or set it up as a trunas/unraid setup, or something else since I have 56GbE and IB(RDMA, RCoE)

submitted by /u/Professional_Lychee9
[link] [comments]

APC Rack Air Removal Unit compatibility with APC AR3300 Rack

Hello all

Does anybody know if the APC "Rack Air Removal Unit", Model = ACF102BLK is compatible with the APC AR3300?

I was able to find the datasheet for the ACF102BLK model from the official APC website, but there is nothing writen if they fit on the AR3300 rack model.

I have the strong feeling it should because of the dimensions but I just want to be sure, before i spend any money.

https://www.apc.com/ch/de/product/AR3300/netshelter-sx-geh%C3%A4use-42-he-600-mm-b-x-1200-mm-t-mit-schwarzen-seitenteilen/

https://www.apc.com/ch/de/product/ACF102BLK/apc-air-removal-unit-208-230-50-60hz/

Thank you

submitted by /u/SuperbValue4505
[link] [comments]

Dell Poweredge R720 and GY1TD NvME pci

I recently made some necessary updates to our lab by upgrading some of our older servers to handle storage.

I currently have 3 poweredge R720's on my rack and I wanted to use them specifically for Ceph storage handling.

I have installed the GY1TD card which has a PEX 8734 switch internally and can handle x4x4x4x4 bifurcation. I had also replaced the sas backplane with the necessary one to allow u.2 drives to work. All these parts are Dell parts and the drives light up and looks like they connect.

The problem is the following..

If I have the drives connected at boot, the boot process gets stuck at "initializing firmware".

If I remove the drives out of the caddy but I have the backplane and pic card connected then the server boots fine. But if I put the drives back in then the drive caddy lights up green and looks like it's doing something but I can't see the drive at all on the host. fdisk, blkid, lsblk nothing shows the drives.

I do not want to boot from these drives but I do want to use them strictly for storage on ceph as the poweredge servers have all been updated to 100Gb fiber links in-between the cluster.

I have also removed the perc card that was in the servers originally.

What can I do to make this card work ? I want to create an all flash ceph cluster and im having a real hard time with it.

lspci output below

04:00.0 Ethernet controller [0200]: Mellanox Technologies MT27520 Family [ConnectX-3 Pro] [15b3:1007] `Subsystem: Mellanox Technologies MT27520 Family [ConnectX-3 Pro] [15b3:0007]` `Kernel driver in use: mlx4_core` `Kernel modules: mlx4_core` 05:00.0 PCI bridge [0604]: PLX Technology, Inc. PEX 8734 32-lane, 8-Port PCI Express Gen 3 (8.0GT/s) Switch [10b5:8734] (rev ab) `Subsystem: Dell PEX 8734 32-lane, 8-Port PCI Express Gen 3 (8.0GT/s) Switch [1028:1f84]` `Kernel driver in use: pcieport` 06:04.0 PCI bridge [0604]: PLX Technology, Inc. PEX 8734 32-lane, 8-Port PCI Express Gen 3 (8.0GT/s) Switch [10b5:8734] (rev ab) `Subsystem: Dell PEX 8734 32-lane, 8-Port PCI Express Gen 3 (8.0GT/s) Switch [1028:1f84]` `Kernel driver in use: pcieport` 06:05.0 PCI bridge [0604]: PLX Technology, Inc. PEX 8734 32-lane, 8-Port PCI Express Gen 3 (8.0GT/s) Switch [10b5:8734] (rev ab) `Subsystem: Dell PEX 8734 32-lane, 8-Port PCI Express Gen 3 (8.0GT/s) Switch [1028:1f84]` `Kernel driver in use: pcieport` 06:06.0 PCI bridge [0604]: PLX Technology, Inc. PEX 8734 32-lane, 8-Port PCI Express Gen 3 (8.0GT/s) Switch [10b5:8734] (rev ab) `Subsystem: Dell PEX 8734 32-lane, 8-Port PCI Express Gen 3 (8.0GT/s) Switch [1028:1f84]` `Kernel driver in use: pcieport` 06:07.0 PCI bridge [0604]: PLX Technology, Inc. PEX 8734 32-lane, 8-Port PCI Express Gen 3 (8.0GT/s) Switch [10b5:8734] (rev ab) `Subsystem: Dell PEX 8734 32-lane, 8-Port PCI Express Gen 3 (8.0GT/s) Switch [1028:1f84]` `Kernel driver in use: pcieport` 0d:00.0 PCI bridge [0604]: Renesas Technology Corp. SH7757 PCIe Switch [PS] [1912:0013] `Subsystem: Renesas Technology Corp. SH7757 PCIe Switch [PS] [1912:0013]` `Kernel driver in use: pcieport` 0e:00.0 PCI bridge [0604]: Renesas Technology Corp. SH7757 PCIe Switch [PS] [1912:0013] `Subsystem: Renesas Technology Corp. SH7757 PCIe Switch [PS] [1912:0013]` `Kernel driver in use: pcieport` 0e:01.0 PCI bridge [0604]: Renesas Technology Corp. SH7757 PCIe Switch [PS] [1912:0013] `Subsystem: Renesas Technology Corp. SH7757 PCIe Switch [PS] [1912:0013]` `Kernel driver in use: pcieport` 0f:00.0 PCI bridge [0604]: Renesas Technology Corp. SH7757 PCIe-PCI Bridge [PPB] [1912:0012] `Subsystem: Renesas Technology Corp. SH7757 PCIe-PCI Bridge [PPB] [1912:0012]` 10:00.0 VGA compatible controller [0300]: Matrox Electronics Systems Ltd. G200eR2 [102b:0534] `DeviceName: Embedded Video` `Subsystem: Dell G200eR2 [1028:048c]` `Kernel driver in use: mgag200` `Kernel modules: mgag200` 
submitted by /u/mtheimpaler
[link] [comments]

DIY TNSR hardware for 10k+ request per second?

1 Junio 2024 at 20:22

I download about 500tb of data per month using dual 1gbps connections and pfsense running on an old i7-3770k. I'm typically making 1k+ connections per second; 80% outbound get request, 20% inbound through tailscale tunnels from 10 budget VPS's.

I just upgraded my residential connection an 8gbps connection and am about two weeks out from adding another 8gbps connection. I have a combination of 10gb and 40gb connections between my servers.

Based on some reddit research I figured out that pfsense doesn't work well for 10gb L3 switching and that I need to migrate to TNSR or maybe Vyos(less preferred as I prefer GUI).

I'm trying to figure out what a decent setup would be based on my work load? I'm assuming like a xeon D1541 or any lga 3647 would be fine. Just not sure what is the best route to go, DIY 2U build or some dell/hpe setup which is hopefully cheap (less than $500). Any thoughts or suggestions?

p.s.Before anyone says anything, I have been downloading these large amounts of data for years out of my house and have never got a single warning message from an ISP. This server will be going into a sound deadening cabinet which i picked up for cheap and is where my 1.5pb of hdd and flash live, so ideally a 1U or 2U build to conserve space.

submitted by /u/9302462
[link] [comments]

Huawei Server Bios Password Reset.

Hello,

I have a Huawei RH2285 V2 rack server that I got from a friend. I added a bios password which I have forgotten and didn’t set up my access to Huawei’s management portal. How can I reset the Bios. I’ve tried removing the CMOS, jumping the BIOS-RCV pins and contacting Huawei which said I can’t get support unless I renew the device’s warranty. I can’t find any service manuals online. Any help would be greatly appreciated.

Thanks in advance

submitted by /u/CircuitMan8897
[link] [comments]

Server security

10 Mayo 2024 at 03:14

EDIT: I ditched Traefik, and Authentik. I am now using CloudFlare zero trust tunnels, closed all ports on my router and the attacks have completely stopped.

I recently posted about my server getting hundreds of requests and attacks, I followed through on some recommendations.

I ditched TrueNAS and went back to my Unraid Pro installation.

I’ve added JavaScript challenges through CloudFlare which has helped drop my traffic down to 200 from 20k per 24 hours. I set up Authelia, as well as CA Certs instead of Self Signed. HSTS. and a few other firewall rules for Trusted IPs.

I’m in the process of learning how to use crowdsec as another layer of protection. I’m looking for more recommendations. I don’t really like the feel of Authelia as the UI is rather huge lol for a login form.

The amount of attacks my router has detected since these changes have been 2 in the past day or two that is blocked.

submitted by /u/SpoofedXEX
[link] [comments]

Attacks on server seems excessive?

Follow up; After doing more digging. It looks like something or someone was able to actually inject a shell script into my traefik “app”. I resolved it, I will be switching to a different ingress system. I have been looking into using portainer to spin up docker images.

So, I self host using TrueNAS Scale and I have 12 "apps" that run constantly.

bookstack
hastebin
maintainerr
ollama
overseerr
plex
radarr
sabnzbd
sonarr
tautulli
tdarr
traefik

I've never noticed anything out of the ordinary other than cloudflare showing I have on average 19k requests per 24 hours for services I pretty much use. I know bots will account for a lot of these once a domain is cached on Google and gets picked up on scanning etc.

I checked my router, it shows that every day, every hour for the last 3 months there has been a "web shell script" attack blocked. I checked my servers logs and still see nothing out of the ordinary, I feel like it is a bit excessive to be this much.

Of the 12 apps, 8 are forward facing to the internet and passed through cloudflare on specific use domains. Served with Full end-to-end SSL certs.

Just paranoid.

Edited; Accidentally put month in place of 24 hour measurement.

submitted by /u/SpoofedXEX
[link] [comments]

Help for network configuration

5 Mayo 2024 at 13:22

Hello,

I need some help on the network, let me explain, I have a pool of public IPs, I want to assign these IPs to VMs, without doing port forwarding (which I currently do), I would like each VM to have directly the public IP that is assigned on their network card.

In terms of infrastructure, I have a Fortigate 60F, a ubiquiti 48 PRO switch, and the hypervisor is vSphere 8.

Thanks in advance for your help

submitted by /u/TyZen6
[link] [comments]

Any ideas to rent my servers?

Hi mates!

I have lot of space and I am considering the idea of set up some micro data centers. In this industry we are competing with the big data centers that are offering affordable solutions. But maybe is a commercial niche for the small ones? Like offering the resources or services like image generator, LLM for specific niche..? And where I offer these services? Or just email some AI startups? What do you think? Any ideas? Thank you so much in advanced

submitted by /u/unicorn_startup77
[link] [comments]

Storage Server

22 Abril 2024 at 20:11

I'm trying to buy a storage server. I have a lot of data collected over the years and have been using USB drives and a Synology NAS for storage and backup. The primary use will be storage/backup (likely TrueNAS), but it will also be used as a media server (movies, TV, music, audiobooks, ebooks, comics, etc.). And I've recently started getting into self-hosting, so I'm thinking about loading it with Proxmox and running TrueNAS on top of that, for limited other uses.

There are some Supermicros I've found in my price range and seem to have what I need. But I'm having trouble finding good information about how to go forward. For example, I'd need some sort of graphics capability and I have my doubts that I could fit a full-size graphics card into most storage serves. And how do I gauge what I'd really need in the way of processors; Xeons are a different from what I'm used to. And what about keeping the power costs within reason? [sigh] I wish there was a pcpartpicker site for servers. I've done a ton of research, but I'm bad about missing what others find obvious. And most of what I do find is either way below what I need (say, a 2-drive NAS) or way above (enterprise). Are there any resources, sites, whatever that would help? Thanks.

submitted by /u/Pramathyus
[link] [comments]

My home datacenter

My home datacenter

3x R630z worth 256gb ram, and dual 2599 v4s R730 with 384gb ram and dual 2599 v3s R420 with...something.....no idea C4140 with 256gb ram and 4xP100s. And 640GB raw space (about half a petabyte of usble space)

Dual 20a 240v circuits

10g netoworking for servers and 1/2.5g for rest of the house

Ubiquit network and making some changes hence the spagetti crap.

I am an AI student and business owner. This is where the magic happens lol

submitted by /u/Professional_Lychee9
[link] [comments]

Need advice on electrical and maybe upgrade suggestions.

12 Abril 2024 at 00:21

Hello! Long time lurker at r/homelabs and r/selfhosted, and now here! I’ll be starting my journey from average pc builder to average homelaber soon.

The plan is to eventually put a small rack to my office closet. I’m not exactly sure what I’ll be running or hosting, but it will probably be home to my home built NAS, a bout a dozen mini pc’s, my plex server, a few game servers, etc. I’ll also be relocating my modem to this closet and will be adding 2.5gb switch to serve the home. I also plan to add a UPS at some point.

I need an outlet or two added to this closet in my home office. Currently there are none. So I’m wondering do we stick with a 15amp breaker, or do I need bigger like a 20 or 30? Or is it better I split the load between say two 15amps? Luckily the Main Breaker is going to be about 10 feet away so cost probably won’t be a big issue. I just don’t know how much stuff like this will draw and I wanna be sure it’s enough. (Live in the US btw)

I’m aware that closets are sometimes a bad choice. This one is 6x8x8, and does have duct work leading into it. I live in AZ so it will get decent cooling and I’ll close the vent for our “winter”. I’m considering a passive vent added to the bottom of the closet door, and a basic exhaust fan into the attic space above as well. But maybe only thermal regulated..

Any suggestions or tips for these things, or maybe things you guys would have done differently. Wanna start this journey out on a decent foundation.

Thank you for looking!

submitted by /u/nadun29
[link] [comments]
❌
❌