首页 文章

Kubernetes服务与kafka无法正常工作

提问于
浏览
0

我正在尝试将zookeeper和kafka设置为共享命名空间中的单独Kubernetes部署/ pod . 我在我的Ubuntu沙箱上通过kubeadm引导了一个本地K8s 1.8和Calico ......

对于Zookeeper,我正在使用图像zookeeper:3.4来自hub.docker.com,我创建了一个Kubernetes部署和服务,我公开了端口:2181 2888 3888.服务名称是zookeeper,我认为我应该可以使用它来自命名空间中pod的主机名 .

对于Kafka 1.0,我创建了自己的容器映像,我可以使用环境变量控制...我将zookeeper.connect设置为zookeeper:2181 . 我假设Kubernetes DNS将解决此问题并打开与服务的连接 .

不幸的是我得到:

[2018-01-03 15:48:26,292] INFO Waiting for keeper state SyncConnected (org.I0Itec.zkclient.ZkClient)
[2018-01-03 15:48:32,293] INFO Terminate ZkClient event thread. (org.I0Itec.zkclient.ZkEventThread)
[2018-01-03 15:48:46,286] INFO Opening socket connection to server zookeeper.sandbox.svc.cluster.local/10.107.41.148:2181. Will not attempt to authenticate using SASL (unknown error) (org.apache.zookeeper.ClientCnxn)
[2018-01-03 15:48:46,299] INFO Socket connection established to zookeeper.sandbox.svc.cluster.local/10.107.41.148:2181, initiating session (org.apache.zookeeper.ClientCnxn)
[2018-01-03 15:48:46,319] INFO Session establishment complete on server zookeeper.sandbox.svc.cluster.local/10.107.41.148:2181, sessionid = 0x10000603c560001, negotiated timeout = 6000 (org.apache.zookeeper.ClientCnxn)
[2018-01-03 15:48:46,331] INFO Session: 0x10000603c560001 closed (org.apache.zookeeper.ZooKeeper)
[2018-01-03 15:48:46,333] FATAL Fatal error during KafkaServer startup. Prepare to shutdown (kafka.server.KafkaServer)
org.I0Itec.zkclient.exception.ZkTimeoutException: Unable to connect to zookeeper server 'zookeeper:2181' with timeout of 6000 ms
    at org.I0Itec.zkclient.ZkClient.connect(ZkClient.java:1233)
    at org.I0Itec.zkclient.ZkClient.<init>(ZkClient.java:157)
    at org.I0Itec.zkclient.ZkClient.<init>(ZkClient.java:131)
    at kafka.utils.ZkUtils$.createZkClientAndConnection(ZkUtils.scala:115)
    at kafka.utils.ZkUtils$.withMetrics(ZkUtils.scala:92)
    at kafka.server.KafkaServer.initZk(KafkaServer.scala:346)
    at kafka.server.KafkaServer.startup(KafkaServer.scala:194)
    at kafka.server.KafkaServerStartable.startup(KafkaServerStartable.scala:38)
    at kafka.Kafka$.main(Kafka.scala:92)
    at kafka.Kafka.main(Kafka.scala)

所以我假设我的群集中有一个通用的网络问题,然后我注意到一些让我更加困惑的事情...如果我将zookeeper.connect设置为10.107.41.148:2181(zookeeper服务的当前地址),则连接工作(至少从 Kafka 到动物园管理员) .

[2018-01-03 15:51:31,092] INFO Waiting for keeper state SyncConnected (org.I0Itec.zkclient.ZkClient)
[2018-01-03 15:51:31,094] INFO Opening socket connection to server 10.107.41.148/10.107.41.148:2181. Will not attempt to authenticate using SASL (unknown error) (org.apache.zookeeper.ClientCnxn)
[2018-01-03 15:51:31,105] INFO Socket connection established to 10.107.41.148/10.107.41.148:2181, initiating session (org.apache.zookeeper.ClientCnxn)
[2018-01-03 15:51:31,134] INFO Session establishment complete on server 10.107.41.148/10.107.41.148:2181, sessionid = 0x10000603c560005, negotiated timeout = 6000 (org.apache.zookeeper.ClientCnxn)

通过这种设置,我可以使用来自kubernetes集群主机的zookeeper服务来做例如“bin / kafka-topics.sh --list --zookeeper 10.107.41.148:2181” . 制作一条消息不起作用......我假设一旦网络正常工作,我需要添加kafka广告的地址...

kafka-console-producer.sh --broker-list 10.100.117.196:9092 --topic test1
>test-msg1
>[2018-01-03 17:05:35,689] WARN [Producer clientId=console-producer] Connection to node 0 could not be established. Broker may not be available. (org.apache.kafka.clients.NetworkClient)

任何提示我的Kubernetes网络设置有什么问题或者至少在哪里开始故障排除?

谢谢你,最诚挚的问候,帕维尔

1 回答

  • 0

    如果使用状态集,则需要先部署服务 .

    在这里我的服务 .

    apiVersion: v1
    kind: Service
    metadata:
      name: zookeeper
      labels:
        app: zookeeper
    spec:
      clusterIP: None
      ports:
      - port: 2181
        name: client
      - port: 2888
        name: server
      - port: 3888
        name: leader-election
      selector:
        app: zookeeper
    

    这里是configmap(稍后使用):

    apiVersion: v1
        kind: ConfigMap
        metadata:
          name: zookeeper-cm
        data:
          jvm.heap: "1G"
          tick: "2000"
          init: "10"
          sync: "5"
          client.cnxns: "60"
          snap.retain: "3"
          purge.interval: "0"
    

    在这里我的状态:

    apiVersion: apps/v1beta1
    kind: StatefulSet
    metadata:
      name: zookeeper
    spec:
      serviceName: zookeeper
      replicas: 1
      template:
        metadata:
          labels:
            app: zookeeper
        spec:
          affinity:
            podAntiAffinity:
              requiredDuringSchedulingIgnoredDuringExecution:
                - labelSelector:
                    matchExpressions:
                      - key: "app"
                        operator: In
                        values:
                        - zookeeper
                  topologyKey: "kubernetes.io/hostname"
          containers:
          - name: zookeeper
            imagePullPolicy: IfNotPresent
            image: sorintdev/zookeeper:20171204m
            resources:
              requests:
                memory: 500Mi
                cpu: 200m
            ports:
            - containerPort: 2181
              name: client
            - containerPort: 2888
              name: server
            - containerPort: 3888
              name: leader-election
            env:
            - name : ZK_REPLICAS
              value: "1"
            - name : ZK_HEAP_SIZE
              valueFrom:
                configMapKeyRef:
                    name: zookeeper-cm
                    key: jvm.heap
            - name : ZK_TICK_TIME
              valueFrom:
                configMapKeyRef:
                    name: zookeeper-cm
                    key: tick
            - name : ZK_INIT_LIMIT
              valueFrom:
                configMapKeyRef:
                    name: zookeeper-cm
                    key: init
            - name : ZK_SYNC_LIMIT
              valueFrom:
                configMapKeyRef:
                    name: zookeeper-cm
                    key: tick
            - name : ZK_MAX_CLIENT_CNXNS
              valueFrom:
                configMapKeyRef:
                    name: zookeeper-cm
                    key: client.cnxns
            - name: ZK_SNAP_RETAIN_COUNT
              valueFrom:
                configMapKeyRef:
                    name: zookeeper-cm
                    key: snap.retain
            - name: ZK_PURGE_INTERVAL
              valueFrom:
                configMapKeyRef:
                    name: zookeeper-cm
                    key: purge.interval
            - name: ZK_CLIENT_PORT
              value: "2181"
            - name: ZK_SERVER_PORT
              value: "2888"
            - name: ZK_ELECTION_PORT
              value: "3888"
            command:
            - bash
            - -c
            - zkGenConfig.sh && zkServer.sh start-foreground
            readinessProbe:
              exec:
                command:
                - "zkOk.sh"
              initialDelaySeconds: 10
              timeoutSeconds: 5
            livenessProbe:
              exec:
                command:
                - "zkOk.sh"
              initialDelaySeconds: 10
              timeoutSeconds: 5
            volumeMounts:
            - name: data
              mountPath: /var/lib/zookeeper
          securityContext:
            runAsUser: 1000
            fsGroup: 1000
      volumeClaimTemplates:
      - metadata:
          name: data
        spec:
          resources:
            requests:
              storage: 1Gi
          accessModes:
          - ReadWriteOnce
          storageClassName: zookeeper-class
    

    部署了正在运行的zookeeper配置并且他们选择了主服务器后,您可以继续进行kafka部署 .

    部署zookeeper后,您的kafka配置必须通过该服务引用zookeeper statefulset . 在kafka中,您必须定义/覆盖此属性:

    --override zookeeper.connect=zookeeper-0.zookeeper:2181
    

    从一个吊舱内你应该成功地做到了.963649_ .

    希望这可以帮助 .

相关问题